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(57) Abstract 

The invention relates to a nucleotide sequence which comprises a sequence involved in the expression of the human Fatty Acid 
Transport Protein (hFATP) comprising the aminoacid sequence of the Figure. 
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Nucleotide sequence expressing human Fattv Acid Transport Protein and 
corresponding aminoacid sequence. Use for the regulation of fatty acids 
metabolism . 

The invention relates to nucleotide sequences involved in the expression 
of the human Fatty Acid Transport Protein (hFATP) and relates to the Fatty 
Acid Transport protein. 

Fatty acids, especially long-chain fatty acids, are the structural 
components of several classes of lipids; they represent an important energy 
source, particularly with respect to glucose, for various tissues or organs. Fatty 
acids furthermore participate in several cellular signaling processes. 

Excessive intracellular accumulation of fatty acids due either to errors in 
metabolisms or to increased supply can have severe pathological 
consequences. Furthermore, the involvement of fatty acids in several diseases 
such as insulin resistance and coronary artery disease made the inventors 
focuss research efforts on the mechanisms which control the homeostasis of 
these lipid moieties. 

Interestingly, fatty acids are ligands and modulators of transcription 
factors, which control their own metabolism. This emphasizes the fact that the 
various metabolic pathways are not only controlled by substrate supply, but 
also by the participation of substrates and metabolic intermediates in regulatory 
phenomena. 

Intracellular Fatty Acid (FA) concentrations are in part determined by 
proteic regulators, in particular by regulating import/export system that is 
controlled by several proteins including Fatty Acid Transport Protein (FATP) 
and acyl-CoA synthetase (ACS). 

Fatty Acid Transport Protein (FATP) was first isolated in the mouse in 
1994 by a functional cloning approach (Schaffer et al, 1994). 

The inventors have now isolated and characterized the human FATP 
coding sequence and the gene comprising said coding sequence. They have 
furthermore identified the chromosomal location of the FATP gene, in a region 

CONFIRMATION'COPY 
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implicated In several metabolic diseases, suggesting that FATP is part of an 
important group of synthenic genes. hFATP is expressed in several organs 
(heart, brain, liver) and tissues (adipose tissue, muscle) and is especially highly 
expressed in insulin-sensitive tissues. Regulatory studies suggest that its 
expression is under the control of several transcription factors including PPAR 
(Peroxysome Proliferator Activated Receptor) (Martin et a!., 1997). Especially, 
Martin G. et al.. 1997, have shown that the expression of Fatty Acid Transport 
Protein is regulated by PPARa and PPARy activators, such as fibrates or 
antidiabetic thiazolidinedione. The inventors have now determined that the 
expression of the hFATP gene is also under the control of the Retinoid X 
Receptor (RXR) known to transduce the effects of retinoic acid on gene 
expression and involved in the regulation of lipid and glucose metabolism. 

Therefore, FATP is an important target for therapeutic agents used in the 
treatment of several pathological states resulting from anomalies in regulation 
pathways of expression, especially of transcription or translation of the gene 
coding for FATP, either directly or through transcription factors. 

The invention thus relates to genetic means and especially to nucleotide 
sequences involved in the regulation pathway of the long-chain fatty acid 
metabolism and therefore makes available new compounds appropriate for the 
definition of therapeutic means useful for treating pathological disorders related 
to the metabolism of long-chain fatty acids. 

An object of the invention is therefore a nucleotide sequence which 
comprises a sequence involved in the expression, of the human Fatty Acid 
Transport Protein (hFATP) comprising the aminoacid sequence of Figure 2 or 
Figure 5. 

The nucleotide sequence of the invention can be any type of sequence 
including any DNA especially genomic DNA, synthetic DNA, RNA and 
especially mRNA, said sequences being sense or antisense sequences. 

Therefore, the invention relates to these sequences when they are 
independently of the conditions used to obtain them, thus including sequences 
either extracted from a biological sample, or cloned or synthesized including by 
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enzymatic or chemical processes. 

By the expression "a sequence involved in the expression", it is intended 
within the present invention a sequence coding for the Fatty Acid Transport 
protein (FATP) and especially human FATP, or a sequence involved in the 
regulation of the steps required to express the FATP gene, especially the 
hFATP gene, or both sequences operably linked. Said regulation sequences 
can be derived or are derivable from the native genomic sequence of the FATP 
gene, especially hFATP gene. 

Regarding sequence coding for the FATP, especially hFATP. the 
invention pertains to a nucleotide sequence which is capable, when placed 
under the control of appropriate regulation elements, especially regulation 
nucleotide sequences, promoter, enhancer, transcription sites, to be 
transcribed and under appropriate conditions to be translated into an amino 
acid sequence. Said aminoacid sequence can then possibly be processed, 
depending on the expression system chosen. 

Especially, the human Fatty Acid Transport Protein which is expressed is 
characterized by the amino acid sequence that it comprises. Depending from 
the cellular host chosen for its expression, the structure of the obtained hFATP 
can vary, especially as the result of a maturation process, or of environmental 
conditions. 

According to a specific embodiment of the invention, the nucleotide 
sequence comprises a sequence encoding the human FATP, corresponding to 
or comprising the nucleotide sequence having nucleotides 1 to 2222 of one of 
the sequences of figure 1 , figure 2 or figure 3. 

According to a preferred embodiment of the invention, the nucleotide 
sequence encoding the human FATP at least comprises the open-reading 
frame (ORF) corresponding to the human FATP represented on figure 2 or 
figure 5. Alternatively it comprises any fragment of said ORF, coding for a 
polypeptide having the functional properties of the human FATP as far as the 
transport of the long-chain fatty acids is concerned. The coding sequence of the 
human FATP gene is described on figure 1 . figure 2 or figure 3. 
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Such a nucleotide sequence can be placed under the control of the 
native regulation sequences, or part thereof, present in the gene and enabling 
the expression of hFATP or can be placed under the control of heterologous 
regulation sequences according to well-known procedures. 

According to another embodiment, the invention relates to a nucleotide 
sequence hybridizing in stringent conditions, with a probe comprising 50 to 
2000 bp. preferably 50 to 300 nucleotides especially around 200 bp. said probe 
including at least 6. preferably 9 continuous nucleotides from the following 
sequence : CGGGGAGACGGGACGTGAAGGG. 

As an example, a sequence which is advantageously used to provide for 
a probe appropriate to selectively hybridize with the nucleotide sequence of the 
invention comprises at least some nucleotides contained in the 5* non-coding 
sequence upstream from the ATG codon of the coding sequence of the gene 
expressing the hFATP. 

A probe replying to this definition is considered to be specific for the 
human gene encoding the hFATP or derived genomic nucleotide sequences, 
expressing the human FATP. Thus, it does not hybridize with murine, rat or 
yeast cDNA sequences. Such a probe can be an amplification product resulting 
from amplification with primers wherein one primer is specific for the human 
hFATP gene, said primer comprising at least 6, preferably 9 nucleotides 
selected from the above nucleotide fragment. 

The second primer can be taken in any region of the gene and especially 
in the sequence comprising nucleotides 1 to 2222 of the sequence of figure 1 . 
or figure 3. 

The stringent hybridization conditions used according to the invention 
can be defined with respect to the following parameters, referring to 
hybridization of a DNA probe, especially a cDNA probe with a total RNA or 
DNA: 

- hybridization at 42"C. 

- 2 washing steps at 42^*0 for 10 minutes in 0.5 x SSC, followed by 2 
washing steps at 65**C for 30 minutes in 0.5 x SSC. 
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Details regarding these conditions can be found in the examples. 

According to a specific embodiment of the invention, the nucleotide 
sequence is a genomic DNA sequence and especially is the gene encoding the 
human FATP. 

According to another embodiment of the invention, the nucleotide 
sequence comprises a transcription initiation site 61 bp upstream from the ATG 
codon. Especially, the nucleotide sequence comprises the sequence coding for 
the hFATP and the 5' region of the gene containing regulation sequences 
including the transcription initiation site. 

The structure of the sequence of the gene between the ATG and TGA 
codons. can be represented as follows: 

794 
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ATG TGA 
The respective size of each exons illustrated on figure 8 starting from position 1 
at the ATG codon is the following: 



exon 1 (167 pb), exon 2 (395 pb). exon 3 (162 pb). exon 4 (70 pb). exon 5 (92 
pb). exon 6 (111 pb), exon 7 (209 pb), exon 8 (127pb). exon 9 (138 pb), exon 
10 (165 pb), exon 11 (147 pb), exon 12 (158 pb). 

The introns present between said exons, have respectively the following 
5* and 3' ends: intron 2 (5' gtgaggcc.gaccacag 3'); intron 3 (5' 
gtgagtca„.tcctgcag 3'); intron 4 (5* gtgagggg... cccctgcag 3*); intron 5 
(5* gtactacg...ctctgcag 3'). intron 6 (...cgtcccccac 3^); intron 7 
(5* gtgcacacc.cattccag 3'); intron 8 (5* gtgagcag...ctccctag 3*); intron 9 (5* 
gtgcgcag...tctgccag 3'); intron 10 (5' gtcaagct...gcctccag 3*); intron 11 (5* 
gtgcgagt...cactatag 3'). Intronic regions of a hFATP gene according to the 
invention are represented on figure 4 which discloses the genomic sequence 
encoding the human FATP. 

Advantageously, the nucleotide sequence according to the above-given 
definitions comprises further sequences involved in the regulation of the 
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expression of the gene coding for the human FATP. present in the non coding 
5' and/or 3' regions of the gene. 

The sequences involved in the regulation of the expression of the gene 
include sequences containing the promoter region, the transcription initiation 
5 site and other regions involved in the activation of the expression or in 

enhancement thereof. 

Regulation of the expression also involves the regulation of the 
translation of the gene and the corresponding sequences. 

Variants of the above-defined nucleotide sequences comprise a 
,0 sequence vy^hich either specifically hybridizes in high strigency conditions v^ith 
primers having the following sequences primer 1 
AAGGTCAATGAGGACACAATGG (sense), primer 2 

CGAGTAGGTAGTGATCGTGCAG (antisense), or is the amplification product 
obtained with the following sequences, or hybridizes in high stringency 
15 conditions with said amplification product. 

Another variant of the sequence is a genomic sequence coding for 
hFATP and which hybridizes in high stringency conditions with a probe 
containing around 200 pb. said genomic nucleotide sequence preferably 
comprising a sequence involved in the regulation of the gene coding for the 
20 hFATP. 

A specific probe containing around 200 bp which is capable of 
specifically hybridizing with a genomic sequence coding for FATP can for 
example be found in the sequence of exon 2. 

Another nucleotide sequence according to the invention is the mRNA 
25 which is a sequence obtainable by transcription of a genomic nucleotide 
sequence coding for the hFATP and replying to the above definition. 

The mRNA sequence obtainable from the gene coding for the hFATP 
comprises the nucleotide sequence complementary to the sequence of figure 1 . 
or figure 3. having 2 kb. Preferably, it is a sequence including a nucleotide 
30 sequence of 4.4 kb capable of hybridizing in high stringency conditions with the 
sequence of figure 1 . figure 2 or figure 3. 
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The invention also specifically relates to the cDNA as obtained by 
reverse transcription of the mRNA defined above. Such cDNA sequence 
comprises the sequence of figure 1 or figure 3 containing nucleotides 1 to 2222 

or the sequence of figure 2. 

The invention also concerns polynucleotides selected among the 

following sequences; 

- AAGGTCAATGAGGACACAATGG, 

- CGAGTAGGTAGTGATCGTGCAG. 

- a sequence comprising or corresponding to sequences involved in the 
regulation of the expression of the gene encoding the hFATP, 

- any fragment of a nucleotide sequence defined above especially 
derived from the sequences disclosed in figures 1, 3 or 4, for instance by 
deletion mutation or insertion provided the essential biological properties of the 
native sequences are maintained, specific for the hFATP gene, including 
fragments that can be used as primers for amplification reactions, or a probe for 
hybridization. The above cited biological properties are described in the 
examples which follow. 

A further object of the invention is a vector, for the cloning or for the 
expression of a sequence defined above, said vector comprising, inserted in 
site not essential for its replication, a nucleotide sequence as defined above. 

Any appropriate regulation sequence, including heterologous sequences 
with respect to the hFATP gene, and especially nay appropriate promoter, can 
be used for the expression of the nucleotide sequence of the invention cloned 
into the vector. The selection of these regulation sequences can depend upon 
the cell host which is used to carry out this expression. 

Especially, the vector is a plasmid or a phage. Advantageously, it further 
comprises a reporter sequence such as the CAT gene or the sequence of the 
luciferase gene. 

The sequence of the reporter gene which is present in the vector can be 
under the control of an heterologous regulator region especially under the 
control of the regulation sequences of the gene encoding the hFATP. 
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The invention also relates to recombinant ceils especially procaryotic 
cells or eucaryotic cells and advantageously insect or mannnaal cells. 

The invention is also directed to the human Fatty Acid Transport Protein 
comprising the amino acid sequence of figure 2, or figure 5 or to any 
polypeptide fragment having the properties of FATP regarding intracellular 
transport of fatty acids, or to fragments recognized by antibodies directed 
against the human FATP. 

The hFATP has a calculated molecular weight of 71 kDa and an 
apparent molecular v\reight of 63 kDa in Northern Blot analysis. 

According to a specific embodiment of the invention, the hFATP has an 
isoelectric point of 8.5 and replies to the amino acid sequence of figure 2 or 
figure 5. According to another embodiment, this sequence is encoded by a 
nucleotide sequence as defined above. 

Advantageously, said human hFATP is processed and especially devoid 

of its signal peptide. 

The hFATP of the invention can be either under its glycosylated form or 
can be devoid of its glycosylation groups. 

The invention also relates to antibodies specifically directed against 

hFATP. 

In view of the various properties which have been identified with respect 
to the FATP in the regulation pathway of the long-chain fatty acid, the invention 
provides means that can be used in therapeutic compositions especially for 
modulation of the intracellular level of long-chain fatty acids, which comphses a 
component capable of regulating the expression of the gene encoding hFATP. 

Alternatively, the composition of the invention can be used for the 
therapeutic modulation of the blood level of long-chain fatty acids. 

The inventors have especially found out that the involvement of human 
FATP in the modulation of the metabolism of fatty acids, especially long-chain 
fatty acids, can provide a way to treat pathological states related to various 
diseases and for example to obesity, cardiomyopathies and diabetes, 
especially diabetes non-insulino dependent. 
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According to the invention, the expression "treatment" relates to the 
capacity of a composition or compound, to prevent the occurrence of a 
pathological state or to control such pathological state or to improve the 
condition of a patient suffering from a pathological state, where this pathological 
state is the result or involves an abnormal regulation of the long-chain fatty 
acids metabolism. 

Especially, the invention relates to such a composition which comprises 
an inhibitor of the expression of the FATP gene, for instance for the treatment 
of obesity or related diseases. 

Alternatively, the invention relates to a composition which is capable of 
enhancing the expression of the hFATP gene for the treatment of 
cardiomyopathies, diabetes or related diseases. 

The invention further provides means enabling assaying the capacity of 
chosen compounds to have an agonist or antagonist activity with respect to the 
expression of FATP. especially human FATP, in various tissues. Such assay 
can hence comprise the steps of: 

- contacting a culture of cells of a specific tissue, said cells containing the 
hFATP gene, with a tested compound; 

- detecting the effect of said compound on the expression of the FATP 

gene. 

The invention also relates to a method for the screening of the 
expression of the hFATP protein in determined cells when these cells have 
been contacted with a determined compound, which method comprises the 
step of detecting the transcription of the mRNA in the cells or cell extracts. 

In a specific embodiment of the invention, the screening method 
comprises 

a) measuring the level of transcription of the mRNA in cells or cell 
extracts, wherein the cells have previously been contacted with the determined 
compound, in conditions enabling the interaction of said cells and said 
determined compound; 

b) measuring the level of transcription of the mRNA in the same cells 
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species as in step a) or on extracts of these cells, wherein these cells have not 
been previously contacted with the assayed compound; 

c) comparing the level of transcription obtained in steps a) and b). 

Preferably the cells which are used to carry out the above screening 
5 methods are selected among the group of live cells, heart cells, adipose tissue 
cells, skeletal muscle ceils. The above defined methods can allow the 
identification of agonists or antagonists of the expression of the hFATP gene. 

Specific elements of the invention are further disclosed in the examples 
and in the figures: 

10 Figure 1: nucleotide sequence of clones containing 2 kb sequences obtained 
from a cDNA library of human adipose tissue. The sequence on the upper line 
corresponds to the sequence encoding the hFATP protein and the sequence on 
the bottom line corresponds to the alternative foim of splicing found in the 
analyzed clones 



Figure 2: Alignment of the nucleotide sequences of hFATPI, mFATP 
(MMU15976) and rFATP (RNU89529) and of the primary amino acid sequence, 
using the J. Hein method. The glycosylation sites are indicated by an asterisk 
and are boxed. The sequence of 11 amino acids common to the members of 
20 the family of AMP-binding proteins is boxed on the alignments. 

□ Amino acid different from the murine sequence 



Consensus sequence of 1 1 amino acids characterizing the family of 
AMP-binding proteins 

25 D Potential glycosylation site. 

Figure 3: a variant of the nucleotide sequence of clones encoding the human 
FATP protein and obtained from a cDNA library of human adipose tissue. 

30 Figure 4: sequence of the genomic sequence expressing the human FATP 



15 
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protein, including the introns (lower case) and the exons (upper case). 

Figure 5: The amino acid sequence encoded by the nucleotide sequence of 
figure 3 and corresponding to the human FATP protein. 

Figure 6: Comparative analysis based on the Garnier-Robson, Chou-Fasman 
and Kyte-Doolittle algorithms of the murine and human FATP proteins. 

Figure 7: Primer extension with the reverse GM6 primer on 10 pg of human 
visceral adipose tissue total RNA. 

Figure 8: Provisional genomic structure of the hFATP gene. A. Genomic 
structure of hFATP with its 12 exons and 11 introns. The numbers in the boxes 
surrounding the exons indicate the exon numbers. The numbers above the 
boxes indicate the position of the introns. B. The approximate size of introns 2 
to 1 1 (bp) is indicated between the dashes separating the two ends of the 
donor and of the recipient. Intron 6 was identified by sequencing the PAC 
clone. C. Alternative splicing leading to hFATPI or its alternative splicing 
product hFATP. The relevant intron is intron 1 , hFATPI is normally spliced as in 
mice. The splicing of the variant of hFATPI resumes beyond the intron in the 
coding sequence but changes the resading frame. 

Figure 9. The 19p13 region. The arrow indicates the position of hFATP on 
chromosome 19, the shaded zone represents the frequency of the genes 
encountered in the different regions of the chromosome. The arm p of the 
chromposome is represented at the top. The region q is represented at the 
bottom. 

Figure 10. A. Dose response of the effets of 9c-RA on FATP and ACS gene 
expression on FAO ceils 

Cells were kept 18 hr in serum-free medium and treated for 6 hr with increasing 
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concentrations of 9c-RA (10'^ M to 10"* M). Control cells were incubated with 
BSA and vehicle. 30 of total RNA were loaded per lane. Northern blot 
analysis was carried out and blots were hybridized with FATP, ACS. and actin 
probes. Quantification was perfornned with an imaging densitometer (Biorad 
5 GS-670). Values were normalized by comparison with actin control probe and 
results are expressed as percent of control signal, relative arbitrary unit 



Figure 10.B. Dose response of the effects of 9c-RA and at-RA on FATP gene 
10 expression on Hep-G2 cells. 

Cells were kept 18 hr in serum-free medium and treated for 6 hr with (10 M to 
10"^ M) either 9c-RA and at-RA. Controls were incubated with BSA and vehicle. 
Northern blot analysis was carried out and blots were hybridized with an FATP 
and actin probe. 



Figure 1 1. A. Dose response of 9c RA on differentiated 3T3-L1 cells. 
Differentiated 3T3-L1 cells were treated 24 hr with different concentrations 9c- 
RA (O-IO'^M). Northern blot analysis was carried out and blots were hybridized 
with an FATP and ACS probe. Actin was used as control probe. A means 
20 differentiated. 

Figure 11.B. Comparative induction of FATP and ACS in FAO and Caco2 cells. 
FAO(A) and CaCo2 (B) cells were serum deprived 18 hr and then treated 
during 6 hr with 9c-RA (10'^ M). Northern blot analysis was carried out as 
25 described and blots were hybridized with an FATP, ACS and actin probe. 

Figure 12. Nuclear run on analysis of the effect of 9 cisRA on FAO and 3T3-L1 
differentiated cells. 

FAO cells were treated for 2 h with 9c-RA (10"^ M) after 18 h of growth in serum 
30 deprived conditions. Differentiated 3T3-L1 cells were treated dunng 12 h with 
the same concentration of 9c-RA. Transcription rates were then determined for 



(R.A.U.). 
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the FATP. ACS and GAPDH genes in from control cells (-RA) or 9c-RA treated 
cells (+RA). A Bluescript (BS) template was used as a control. Densitonnetric 
scanning of the results is depicted at the right panel. 

5 Figures 13, 14 and 15 

Dose-response of 9c-RA. at-RA and TTNPB and: in non-differentiated and 
differentiated 3T3-L1 cells. 

Differentiated (3T3-L1 A) or non-differentiated (3T3-L1) cells were treated 24 hr 
with each of the retinoids indicated. Northern blot analysis was carried out and 
10 blots were hybridized with an FATP (figure 13). ACS (figure 14) and LPL (figure 
15) probe. Values were normalized by comparison with actin control probe and 
results are expressed in percent of control (R.A.U.). 

Figure 16. Oleate uptake assay on 3T3-L1 differentiated cells. 
\5 Differentiated cells 3T3-L1 (3T3-L1 A) were treated for 24 hr with different 
concentrations of TTNPB (10"® M to 10'^ M) and ^"^C oleate uptake studies were 
carried out. Results were normalized after protein quantification and expressed 
as % of control. Significant differences were indicated by an asterisk * P<0,05. 

20 Figure 17. Tissue expression of FATP in different human tissues. The human 
probe of the B-actin has enabled checking the regularity of the depots. The 
additional band obtained with this probe represents the actin specific of the 
muscle which is thus highly expressed in muscle and brain. 

25 Figure 18. Expression of FATP in different human cell lines and comparison 
with the NCI adipose tissue: surrenal cortex carcinoma cells, THP1: monocytes. 
THP1 diff: THP1 differenciated with PMA. Caco2: cells of colon 
adenocarcinoma, HepG2: liver hepatoma cells, Hep3B: hepatocytes. JEG-3: 
chorion carcinoma cells. 

30 

Figure 19. Distribution of FATP and LPL in muscular skeletal tissue, adipose 
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tissue, liver, colon, and intestine tissues in human. Sc: sub-cutaneous, Vis: 
visceral. 

Exempte 1 

5 

1. MATERIALS AND METHODS 
1. Materials 

BRL 49653 and fenofibric acid were provided by Janssen Research 
Foundation^ in Beerse, in Belgium, by the Foumier laboratories, in Daix, in 
10 France and by Ligand Pharmaceuticals, in San Diego, in the United States. The 
retinoic acids 2-£/s-fef/no/c acid (9cRA), ofl-trans-retinoic acid (at-RA). were 
provided by Ligand Pharmaceuticals. All the other products, unless othenwise 
stated, were obtained from Sigma (St Louis, MO). 

15 2. Human biopsies 

The various omental and subcutaneous human adipose tissues were 
collected from normal or obese subjects during plastic or reconstructive 
operations. The muscular tissues were collected from patients who had 
undergone an operation in the hip region. They were the liver or skeleton 

20 muscle tissue. All the tissues were immediately frozen in liquid nitrogen. 

3. Animals and treatments 

Male rats of the Wistar strain were treated for various lengths of time with 
fenofibrate mixed with the feed in powdered form (weight/weight) or BRL 49653 

25 by gavage. at the concentrations indicated. The weight of the animals and the 
intake of feed were recorded daily for the 7-day treatments and every two days 
for the 14-day treatments. The treatments with the fibrates did not cause 
significant changes in the quantity of fee consumed by the animals. Since each 
rat consumes approximately 20 grams of feed per day, the doses of 0.5, 0.05 

30 and 0.005% (weight/weight) of fenofibrate correspond to 320. 32 and 3,2 mg/kg 
of body weight/day. At the end of the treatments, the rats were sacrificed by 




wo 99/5 1 740 PCT/EP99/02295 

15 

exsanguination under ether anaesthetic. The triglyceride and cholesterol levels 
were measured using a colorimetric detection kit (Boehringer-Mannheim). The 
various tissues were collected, rinsed in 0.9% NaCI and imnnediately frozen in 
liquid nitrogen. The liver and the epididymal adipose tissue were weighed. The 
5 effects of starvation were studied on rats from which food had been withdrawn 
14 hours before being sacrificed. 

4. Cell culture 

The cells of the different hepatic lines used were: Fa 32, a rat hepatoma 

10 cell line derived from Faza967 (Deschatrette et al.. 1974), the mouse AML-12 
hepatocytes (Wu et ai.. 1994), the human hepatoma line HepG2 and the FAO 
cells, a well differentiated subclone of the rat hepatoma line H4 II ECS. All the 
cells were maintained in Dulbecco's modified Eagle's minimal essential medium 
(DMEM). supplemented with 10% decomplementized foetal calf serum, except 

15 AML-12 and FAO. The AML-12 cells were maintained in DMEM/Ham's F-12 
medium supplemented with insulin, transferrin and selenium (ITS, Collaborative 
Research), and dexamethasone (0.1 pM). The FAO cells were cultured in 
DMEM/Ham's F-12 supplemented with 10% decomplementized serum, 
penicillin (200 lU/mi) and streptomycin (50 mg/ml). 

20 The mouse preadipocyte cell lines obi 771 (Negrel et al., 1978) and 3T3- 

L1 (ATCC) were maintainted in DMEM medium with 10% lipid-free and 
decomplementized foetal calf serum. The 3T3-L1 cells were differentiated by a 
two-day treatment with dexamethasone (0.1 |jM), isobutylmethylxanthine (0.25 
mM) and insulin (0.4 pM). These cells, following initiation of differentiation, were 

25 maintained for a further 8 days with insulin until their complete differentiation 
was obtained. 

The L6 muscle cells were cultured in DMEM with 10% foetal calf serum 
up to confluence followed by spontaneous differentiation. 

The Caco2 colon carcinoma cells (ATCC) were cultured in DMEM 
30 medium and 15% decomplementized foetal calf serum as well as antibiotics 
and non-essential amino acids. Above 80% confluence, the cells differentiate 
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spontaneously for 20 days. 
5. RNA analysis 

5.1. Extraction of RNA and quantitative analysis 

The RNAs of the tissues and cells were prepared by the technique using 
caesium chloride and guanidine isothiocyanate (Chomeczynski et aL. 1987). 
For the adipose tissue, a first step of centrifugation at 4°C of the 
honnogenization product was added in order to remove the lipids, which are 
solidified at the surface, and to continue the extraction under the best 
conditions. The dot-blot and Northem-bfot hybridizations of the total or 
messenger RNAs after separation on a poly A column (Stratagene) were 
carried out as described in Auwerx et ai. (Auwerx et al., 1988). Before being 
deposited, the RNAs were assayed in duplicate by spectrophotometry 
(Pharmacia Ultrospec 2000), the difference tolerated between the values was 
less than 5%. The various messengers were measured using cDNA fragments 
obtained by reverse transcription, rt-PCR or by restriction, as probes. The 
probes were labelled with a random primed fabefling kit (Boehringer-Mannheim) 
and a-^^P]dCTP (NEN, Boston, MA): The filters (nylon and nitrocellulose. Pall 
Filtron) were hybridized overnight at 42^0 with 10^ cpm/ml. The washes were 
carried out for 10 minutes at 42°C and for twice 30 minutes at 65°C in a 0.5 X 
SSC buffer and 0.1 % SDS. Next, these blots were exposed to 
autoradiographic films (X-Omat AR or Biomax MS, Kodak). The autoradiograms 
were analysed by densitometric scanning (Biorad GS 670 densitometer). 

5.2 Measurement of the transcriptional activity by run on or transcription in vitro 
The nuclei of the cells and of the tissues were prepared according to the 
Nevins technique (Nevins. 1987). The total RNAs labelled with a-^^P]UTP 
(3000 Ci/mmol) (NEN, Boston, MA), in quantity of equivalent radioactivity, were 
hybridized on membranes on which there had been deposited 5 pg of the 
different plasmids or 1 ^g of the restriction fragments of the genes of interest. 
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5.3. Primer extension 

The location of the site of initiation of hFATP was determined as 
described in Sannbrook et al. (Sambrook et al.. 1989). An antisens 
oligonucleotide, starting at +6 bases from the site of initiation of translation, was 
used 3*-CCC GCA TCC CTT CAC GTC CCG TCT CCC-5'. 10 pg of total RNA 
of human adipose tissue were precipitated with 500,000 cpm of primer labelled 
in 5* using T4 polynucleotide kinase (Amersham, Courtaboeuf. France with y- 
^^P]ATP. denatured for 5 minutes at 95 °C and annealed for 90 minutes at 65**C 
in a hybridization solution. The extension was carried out at 42°C for one hour 
with 50 U AMV-RT {Life Technologies, Paisley, GB) and 100 U MMLV-RT {Life 
Technologies, Paisley. GB). A control sequence reaction was used as standard 
mass marker to locate the 5* end of the extension product. 

6. DNA techniques 

6.1. Screening of complementary DNA libraries and of genomic DNA of phages 
Three libraries were used to isolate the FATP gene, including a 
commercial cDNA library {Human fat cell S'-stretch plus cDNA library HL3016b, 
oligod'1 -primed, Clontech laboratories, Inc. USA). After titration of the libraries, 
10® plaque forming units (PFU) served for the infection of 6 ml of the respective 
competent bacteria (Y1090, XL-1 and SRB) OD6oo=1. maintained in Luria 
Bertani medium with 0.2% of maltose and 10 mM MgS04. The adhesion of the 
phages to the bacterial walls was carried out for 15 minutes at 37°C in 14-ml 
polypropylene tubes. The mixture was then spread with TOP agar {NZ amine, 
casein hydrolysate, yeast extract (NZY) + 0.7% agarose) on 10 culture dishes 
15 cm in diameter. The lysis was carried out overnight at 37°C. Two blottings 
on nitrocellulose filters (NEN, Boston, MA) were carried out on each dish, taking 
care to note the orientation of the dishes. The filters were then treated in a 
denaturing, neutralizing and washing solution, and then fixed under UV or with 
heat. The hybridization, washing and exposure conditions were the same as 
those used for the RNA techniques. In each screening, a double-stranded 
probe was used, derived from rt-PCR or from restriction. The probe was for 
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example a 652 pb fragment, amplified with the primer sense GM1 (ou 382) (5'- 
ATG CGG GOT CCT GGA GCA GGA ACA -3') and antisense GM2 (ou 399) 
(3'-CTG CGT GTC AGG CAG GAT GCT TCT AGG CCC-5^) covering the 5* end 
of mFATP cDNA. The clones isolated from a first screening were then purified 
by two additional screenings. A final step consisted in amplifying the titre of the 
phage clone isolated. 

6.2. Subcloning of the hFATP phage clones 

The phage DNA was then prepared according to the Sambrook 
technique (Sambrook et al., 1989) or amplified by PGR with a high Ftdelity 
polymerase (Boehringer-Mannheim), and then mapped and excised at the level 
of the cloning site in the phage in order to then introduce it into a blue^cript 
(BS) type vector, pBS-KS. The inserts could thus be amplified, sequenced, 
mapped and manipulated more easily. 

6.3. Sequencing and alignment of sequences 

The first clones were sequenced manually with the T7 sequencing kit 
(Pharmacia) or on a PGR product after treating the inserts with a phosphatase 
and an exonuclease I in order to remove the primers and the dinucleotides 
remaining whch interfere with the sequencing reaction. The product of 
sequencing was then deposited on a 6 or 7% acrylamide gel, the migration 
lasted 3-4 h at 60 mA. The sequence was read on the autoradiogram and could 
be as high as 200 bp. 

The sequences were then prepared on an automatic sequencer (ABI 
377, Perkin Elmer) with a PGR sequencing reaction using fluorescent 
terminators (ABI Prism big dye terminator, cycle sequencing ready reaction kit. 
Perkin Elmer Biosystem), the reading is automatic and may be as high as 700 
bp. The sequence alignments and searches were carried out in a first instance 
with Genbank by means of the Lasergene Navigator software (DNA Star) and 
then In a second stage using the Internet on a national center for biotechnology 
information (NCBI) server. The search for homology for the cDNA fragments 
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was carried out using the Blast search programme. The online mendelian 
inheritance in man (OMIM) searches made it possible to identify the genes 
located on the same chromosome as hFATP. 

6.4. Search for introns 

The search for the intron/exon was carried out by PCR amplification on 
genomic DNA by comparison with amplifications on cDNA (phage clone, 
plasmid) or rt-PCR. Direct sequencing on plasmid artificial chromosome (PAC) 
(Genome systems Inc., St. Louis, USA) were undertaken. The junctions were 
identified after alignment of the sequences. 

7. Measurement of the transport of oleate 

7.1. Preparation of the fatty acids 

The labeled fatty acids (^"^C oleate. 40-60 mCi/mmol. NENE) are 
incubated in the presence of albumin in a ratio of between 0.5 and 2 for 45 
minutes at 37''C in 1X Hank's solution. 

7.2. Measurement of transport 

The cells treated with the various activators are rinsed with a Hank's 
solution and then incubated for 1 h with serum-free and glucose-free medium. 
The cells were again rinsed with a 1X Hank's solution with 0.2% of bovine 
serum albumin (BSA) at 37"C and then at 23X, the transport being measured 
at room temperature. The incubation of the cells with labeled oleate lasts for 1 
minute, the stop solution was an ice-cold Hank's solution. The cells were lysed 
with 0.1% SDS, the proteins were assayed on 20mI. the remainder was 
supplemented with 4 mi of scintillation liquid and counted. 

8. Chromosomal location 

The FATP probe was labelled with digoxigenine-1 1-dUTP by nick 
translation (Boehringer-Mannheim). The probe was purified and concentrated 
by precipitation in order to obtain a concentration of 50 ng/^l. The chromosomal 
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preparation was obtained from T lymphocytes stinnuiated with 
phytohaemaglutinin. The cells thus stimulated were synchronized after 48 h of 
culture by the action of methotrexate for 17 h and then the cell cycle was 
unblocked by the addition of thymidine. After 4h 30 min of action, the cells were 
blocked in the metaphase by the action of colcmid. After a hypotonic shock and 
fixing, the preparation was deposited on a slide; to obtain good precision, the 
techniques of G bands obtained by the action of trypsin and after Giemsa 
staining (GTG banding) (Francke et al.. 1978) and the FISH technique were 
combined. The metaphases were then selected on a DMRXA microscope 
(Leica). The were karyotyped and stored in a software for image analysis 
(CHANTAL) developed by Leica. After non-isotopic in situ hybridization (Geffroy 
et al., 1995) with the FATP probe and detection, the metaphases were 
examined with the aid of a fluorescence microscope (Leica type, DMRXA) 
combined with a Leica image analyzing system. The software allowed 
superposition of the metaphase in the G band and in FISH. 

2. ISOLATION AND CHARACTERIZATION OF THE HUMAN FATP GENE 
1. Results 

1.1. Isolation of the complete cDNA from the human gene for FATP 
1.1.1. Purification of the hFATP clones 

Several human adipose tissue cDNA libraries, (including one commercial 
human fat cell 5* stretch plus cDNA library HL3046b. oligodT primed, Clontech 
library) were screened with a murine probe in order to isolate various cDNA 
clones of the human FATP gene. The first screening was carried out on the 
commercially made human adipose tissue cDNA library (Clontech). The probe 
used was a 5* fragment of 654 bp (position 1-654) amplified by rt-PCR on 
mouse adipose tissue RNA with primers 382 (5'- ATG CGG GOT OCT GGA 
GCA GGA ACA-3*) and 399 (3*-CTG CGT GTC AGG CAG GAT GOT TCT AGG 
CCC-5') or alternatively (GTG TCA GGC AGG CAG GAT GOT CTC). This 
probe was inserted into a bluescript plasmid, pBS-KS. at the level of the EcoRV 
site and the sequence of this plasmid, mFATP399, revealed 100% homology 
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with the murine sequence. The screening provided 23 positive clones from 10 
PFU, the probability of isolating a gene having a normal frequency being 1 out 
of 1 million. Next, a second screening was undertaken on another library under 
the same conditions for titration and spreading of the phages and for 
hybridization. This screening provided 6 clones which were identified as 
containing FATP. These clones were purified over three successive screenings. 

The cDNA inserts, excised after digestion with EcoRI, provided 
fragments in a size interval from 300 bp to 2 kb. These clones were amplified 
by PGR using the lambda phage gt11 primers {X gt11 FoPA/ard and Pi gt 11 
Reverse), The sequencing of these PGR products allowed us to differentiate 1) 
the clones whose DNA sequences were homologous to mFATP\ they are the 
partial hFATP clones. 2) the clones whose sequence was overlapping or even 
identical and 3) the clones whose DNA sequence did not align with mFATP. 
The latter category comprises false-positives which proved positive in 
hybridization because of homology with the probe used but whose sequence c 
differentiates them from hFATP, They may be genes already identified or open 
reading frames (ORF) not yet identified. 

Three large inserts, homologous to mFATP (1.6 kb clone 2.a, 2 kb. clone 
3.e and 1 kb. clone 3.g starting at nucleotide +84 following the ATG codon), 
were subcloned to a Bluescript vector at the level of the EcoRI site and 
completely sequenced over the sense and antisense part. These fragments 
were then aligned with the mouse sequence which was. with the rat sequence 
rFATP, highly homologous, the only two sequences known from 1994 to the 
end of 1997. The 1.6 kb clone aligned from position 487 on mice up to position 
1941 which con-esponds to the TGA codon. that is to say the end of the murine 
coding part. This clone therefore had an additional 3' part (147 bp) which was 
identified by rt-PCR as the 3' untranslated transcriptional region (3*UTR) and of 
the FATP gene, a region which is transcribed but not translated. 

The 2 kb clones appeared to be identical during total amplification of 
these 2 kb with the XgtllF and XgtllR primers by PGR. The sequencing of 
these clones made it possible to differentiate them: hFATPI and an alternative 
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form of splicing. The open reading frame of this clone is 1941 bp for hFATPI, 
that is to say identical to the murine coding sequence of 1941 bp. Its alternative 
form of splicing had an 82-base deletion compared with hFATPI, This deletion 
is situated from position 168 to 249 on the mouse sequence or the hFATPI 
sequence. This deletion could result from an alternative splicing, which was 
thus analyzed with the genomic stnjcture. These two clones had 22 bases of 
5*UTR and 265 bases of 3*UTR. 

1.1.2. Characteristics of the clones 

A search in the NCBI library revealed 84% similarity at the nucleotide 
level between hFATPI and MMU15976 (murine FATP sequence identification 
name) and 83% between hFATPI and RNU89529 (ra\ FATP sequence 
identification name). The murine FATP sequence exhibits 93% at the 
nucleotide level with the rFATP sequence. A search for sequence homology 
with the entire library of genes listed by NCBI was carried out with the Blast 
search program. This analysis revealed 57% homology at the nucleotide level 
with the cDNA for rat acyl coenzyme A synthetase (D85100) with an alignment 
over 351 bases, 58% with the murine homologue of ACS (AF33031) with an 
alignment over 348 bases. The hFATPI sequence also exhibited 94% identity 
over 18 bases (D88308) and 56% over 210 bases with the human homologue 
of ACS. The center of this region of homology is indicated in Figure 2. 

At the amino acid level, the protein exhibits 89% homology with the 
murine protein, mPATP, and 89% with the rat protein rFATP. rFATP and 
mFATP have 93% homology. Yeast Fat IP had. for its part, only 54% 
homology with mFATP. 

The hFATP protein comprises 646 amino acids, its calculated molecular 
weight is 71 kDa and its isoelectric point is 8.5. We compared the amino acid 
sequence of hFATPI and from its alternative splicing form. Said alternative 
from encodes a protein of 57 amino acids and ends with an ambre mutation 
which stops translation. By observing the translation in the three phases of the 
complete sequence, we noticed that the sequence following this termination 
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site aligned farther away in another reading frame with hFATPI. Such a 
phenomenon can only be due to a splicing error which can be confirmed by 
analyzing the genomic structure. 

Analysis of the composition of the protein by the DNA Star software 

5 shows 40% of hydrophobic amino acids, which is in agreement with the fact 
that FATP is a transmembrane protein. hFATP has a signal sequence of 30 
amino acids (von Heijne, 1986). The amino acid sequence of hFATP shows 
three potential glycosylation sites (N-X-S/N-X-T), these sites are consen/ed in 
mice, rats and humans. Only one of these sites is identical to the yeast Fat 1P 

10 protein whose amino acid sequence was published in 1997 (Faergeman et al., 
1997). In a very advantageous manner, a sequence of eleven amino acids is 
perfectly identical and conserved between hFATP, mFATP, rFATP. Fat 1 and 
rACS. This oligopeptide (lYTSGTTGLPK) is common to members of the family 
of AMP-binding proteins. On the basis of this sequence conservation, the 

15 common evolutionary line for the FATP. Fat 1P and ACS genes can be 
confirmed. 

The structural units characterizing the secondary structure of the FATP 
protein was studied by comparing its protein analysis using the protean 
program of the DNA Star software (Figure 6). 

20 Using the kyte and Doolittle algorithms, the hydrophilicity profiles of 

hFATP and mFATP was compared (kyte et al.. 1982). An analysis of the 
primary amino acid sequence based on the Chou-Fasman calculations was 
then carried out in order to locate the a and p sheets, and the bent and helical 
regions (Chou et al.. 1974). This study was complete with a Garnier-Robson 

25 analysis locating the same regions but with a different method (Gamier et al.. 
1978). These analyses demonstrated that these proteins exhibited comparable 
profiles. 



destabilizing consensus sequence (ATTTA), which suggests that the complete 
30 message is much longer than 2 Kb. This hypothesis was confirmed further to a 
commercial Northern-blot hybridization with a human FATP probe. The size of 



The 3*UTR region does not contain putative polyadenylation sites or 
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the messenger is approximately 4.4 kb. 

1 .1 .3. Identification of the site of initiation of transcription 

In order to identify the site of initiation of transcription, a 5' extension. 
primer extension, with an antisense primer (reverse GM6) positioned in 5' of the 
coding region was carried out. The number of bases between the primer and 
the extension product was 34 bases, the primer was 27 bases long, which 
locates the site of initiation 61 bases upstream of the ATG. This distance is 
relatively short compared with the average distances of the sites of initiation 
which, in addition, may be highly variable. The location was checked with a 
[apid amplification of qDNA Qnds (RACE) technique. 

1.2. Isolation of the human FATP gene and genomic characterization 
1.2.1. Isolation of the human genomic DNA clone 

In the second instance, a human genomic DNA phage library was 
hybridized with a human probe and 6 genomic clones were isolated from 10^ 
PFU. The genomic characterization of human FATP was carried out by CPR on 
the genomic clones and total genomic DNA. 

A PAC clone, derived from a plasmid capable of inserting fragments up 
to 120 kb was used. This clone was obtained using two primers, the first 
located from nucleotide 1267 to 1288 (GM3) and the second from nucleotide 
1495 to 174 or more preferably from nucleotide 1469 to 1489 (GM4) in the 
sequence. The screening was carried out by PGR on human genomic DNA. 
The determination of the introns on this PAC was carried out by automatic 
sequencing using a sequencing technique developed for long DNA fragments 
(Fajas et aL, 1997). Unlike the genomic clones previously isolated, the 
PAC 14957 clone did not contain the pseudogene. Indeed, the primers used 
were intended to amplify the region which contained an intron, which made it 
possible to avoid again isolating the pseudogene. For the remainder of the 
characterization, this PAC14957 clone and the human total genomic DNA being 
checked were used. 
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1.2.2. Structure of the gene 

On this PAC clone, the introns situated at the 3* end of the FATP gene 
up to position 997 were identified. Beyond this region, towards the 5' end, no 
alignment could be achieved on more than 3000 bp sequenced on a PAC 
14957 clone by the technique of DNA walking towards the 5* region. 
Furthermore, this PAC clone did not appear to hybridize with oligonucleotides 
having the 5' part of the cDNA. In order to determine the positions of the other 
introns, a PCR strategy was applied to human genomic DNA in comparison 
with the rt-PCRs carried out on the human adipose tissue RNA (Figure 8). In 
order, the size of the exons is: 167, 395, 162, 70. 92, 111, 209, 127, 138, 165, 
147 and 158 bp. 

1.3. Chromosomal location of FATP 

With the aim of determining the chromosomal location of hFATP, an in 
situ hybridization technique was used. Twenty interpretable metaphases, 
following the chromosomal preparation of T lymphocytes, were selected for 
hybridization with the hFATP probe. The in situ hybridization with the 
PAC14957 probe, a genomic clone of hFATP, labelled with digoxigenin, was 
located on chromosome 19. more precisely 19p13.1. 

This region is very rich and 126 genes or locus responsible for disease 
were located in this region. Among the genes of greatest interest, at the level of 
the lipid and carbohydrate metabolism, situated in the same 19p13 region, 
there are: the familial hypercholesterolaemic locus, the insulin receptor, the 
atherogenic lipoprotein phenotype, apolipoprotein C-l and the locus involved in 
apo C-ll deficiency. 

2. Discussion 

The human homologue of mFATP was isolated by screening a human 
adipose tissue cDNA library. Two messengers were differentiated: tiFATPI 
which encodes a protein of 646 amino acids and hFATPZ, the truncated form of 
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hFATPI. which encodes a protein of 57 amino acids. The functionality of 
hFATP2 is very unlikely and is derived from an alternative splicing. hFATPI 
comprises 12 exons and 11 introns. 

The hFATPI protein is hydrophobic in nature and its protein profile is 
5 very similar to that of mPATP. hFATPI is a hydrophobic, transmembrane 
protein with three glycosylation sites and a potential signal sequence of 30 
amino acids. Analysis of sequence homologies with other proteins made it 
possible to reveal 1 1 amino acids which are perfectly identical between ACS 
and FATP. These amino acids would constitute a binding site common to these 

10 two proteins. Shaffer and Lodish had already suggested that FATP and ACS 
could act in concert; the inventors have now shown analysis of gene regulation 
that FATP and ACS are regulated in a coordinated manner in tissues sensitive 
to insulin. FATP is thought to allow the entry of fatty acids into the cell and ACS 
is thought to convert them to active metabolic compounds, the acyl CoAs. for 

15 storing energy as a reserve or for the production of energy. 

The location of F>4TPgene on chromosome 19, more precisely 19p13.1. 
is of interest since this region comprises numerous genes involved in the 
metabolism of fatty acids: the locus for apo C-ll deficiency, the metabolism of 
glucose, the insulin receptor and the metabolism of cholesterol with the locus 

20 for familial hypercholesterolaemia. FATP is thought to form part of a syntenic 
group with one of these other genes and to segregate in the same manner. 
Thus, ap C-ll deficiency causes a decrease in the LPL activity and is 
accompanied by accumulation of particles high in triglycerides. The insulin 
receptor has been very widely studied in order to search for the causes of 

25 insulin-dependent diabetes, an abnormality in this receptor prevents the action 
of endogenous and exogenous insulin. Familial hypercholesterolaemia due to a 
mutation in the receptor for the LDL particles causes the development of 
atherosclerosis and of cardiovascular diseases. All these manifestations have 
in common an accumulation of lipids in the bloodstream which may lead to 

30 insulin resistance. 
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Example 2 

INDUCTION OF THE FATP GENE BY DIMER SELECTIVE RETINOIDS 
SUGGESTS THAT PPAR-RXR HETERODIMER IS ITS MOLECULAR 
TARGET. 

Free fatty acids can be released from adipocytes by the hormone- 
sensitive lipase (HSL) or from tripglycerides-rich lipoproteins by lipoprotein 
lipase (LPL). Circulating fatty acids can then cross the plasma membrane either 
by virtue of their lipid solubility (Higgins, 1994) or be taken up by cells in a 
process mediated by the fatty acid transport protein (FATP). A second protein 
found to be involved in the process of long-chain fatty acid uptake is the Acyl 
Coenzyme A synthetase (ACS). FATP acts as a transporter of fatty acids, 
whereas the role of ACS is rather confined in preventing the efflux of fatty acids 
through an esterification process. 

FATP and ACS mRNA levels are regulated in a tissue-specific manner 
by peroxysome proliferator-activated receptors (PPARs) (Martin et aL, 1997). 
PPARs are members of the nuclear receptor gene superfamily. Expression of 
LPL, FATP and ACS is known to be under the control of hypolipidemic and 
hypoglycemic drugs, an effect mediated by PPARs. Three retinoic acid 
receptors, termed RARa, -p and -y, and three retinoid X receptor, designated 
RXRa, -p and -y, are classically thought to transduce the effects of retinoic acid 
(RA) on gene expression. Both 9-c/s RA (9c-RA) and all trans-RA (at-RA) can 
directly bind and activate FlARs. whereas RXR doesn't bind at-RA, but binds 
9c-RA. RXR-PPAR heterodimers respond to both RXR and PPAR ligands 
(Kliewer et al.. 1992). 

RXR agonists function as insulin sensitizers and have beneficial effects 
on hypertiglyceridemia, hyperglycemia and hyperinsulinemia in mouse models 
of NIDDM and obesity (Mukherjee et al.. 1997). This effect is not secondary to a 
variation in the weight of the animals. Furthermore, some of these rexinoids 
allow to determine which heterodimer is specifically involved in gene regulation 
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and hence these ligands provide a new tool for a better understanding of RXR 
action and the development of new pharmacological compounds. 

Because thiazolidinediones. which are PPARy specific ligands. are 
thought to exert part of their antidiabetic effect through an alteration of the fatty 
acid partitioning and activation of PPARy-RXR, the inventors proposed that 
RXR ligands could have similar effects and be potential activators of FATP. 
LPL. and ACS expression. LPL, FATP, and ACS are involved in fatty acid 
partitioning and the levels of circulating free fatty acids depend on the 
expression of these genes. PPARy activators improve glucose homeostasis 
and this effect may be due to the regulation of LPL (Schoonjans et al., 1996). 
FATP and ACS target genes. In order to confirm the implication of PPAR-RXR 
heterodimer in the improvement of glucose homeostasis, the regulation of LPL. 
FATP, and ACS genes by retinoic acid was studied. This allowed to determine 
which dimer was involved in this process and provided a better understanding 
of the regulation of lipid uptake by nuclear receptors. The PPAR-RXR 
heterodimer as the molecular target mediating these effects on fatty acid 
partitioning leading to an improvement of insulin sensitivity. 

1. Material and methods 

Abbreviations 

RA, retinoic acid; at-RA, all-frans RA; 9c-RA. 9-cis RA; RAR, Retinoic acid 
receptor; RXR. retinoid X receptor; RXR-RE. RXR response element; TTNPB. 
ethyl-p-[(E)-2-(5,6.7.8-tetrahydro-5,5,8,8-tetramethyl-2-naphtyl-1- 
propenyl]ben2oic acid; PPAR. peroxisome proliferator-activated receptor; 
DMSO. dimethyl sulfoxide. 

Materials 

at-RA was purchased from Sigma (St. Louis. Mo.) 9c-RA. TTNPB (Mangelsdorf 
et al., 1990) was obtained from ligand pharmaceuticals. 
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Cell culture and treatments 

FAO cells are a well-differentiated subclone derived from the rat 
hepatoma H4 IIEC3 line. They were cultured at ST^'C in a humidified 
5 atmosphere of 5% C02. 95% air in Ham F-12 medium (Gibco-BRL) containing 
10% fetal calf senjm, penicillin (200 lU/ml) and streptomycine (50 mg/ml) 
according to previously published procedures (Meunier-Dumort et aL. 1996), 
Culture medium was changed every 48 hr. Experiments were performed on 
subconfluent cells maintained in culture under serum free conditions. at-RA and 

10 9c-RA were dissolved in dimethyl sulfoxide (DMSO) while oleic acid was 
complexed with bovine serum albumine before addition to the cells (BSA), The 
human hepatoma cell line HepG2 was obtained from ECACC (Porton Down, 
Salisbury, United Kingdom) and the mouse preadipocyte cell line 3T3-L1 from 
(ATCC). These cells were maintained in Dulbecco's modified Eagle's minimal 

15 essential medium (DMEM), supplemented with 10% fetal calf semm (LPDS- 
DCC), L-glutamine and antibiotics, unless stated otherwise. 3T3-L1 cells were 
differentiated initially by a 2 day-treatment with dexamethasone (0.1 pM). 
isobutyl methyl xanthine (0.25mM) and insulin (0.4 pM). Subsequently, the cells 
were then maintained for an additional 8 days with insulin until complete 

20 differentiation. Experiments were perfomned on subconfluent cells maintained in 
culture under serum free conditions. at-RA and 9c-RA were dissolved in 
dimethyl sulfoxide (DMSO) while oleic acid was complexed with bovine serum 
albumine (BSA) before addition to the cells. 



RNA analysis 

RNA preparation, northern blot hybridizations, and quantification of total 
cellular RNA were performed as described previously (Auwerx et al.. 1988). A 
mouse FATP cDNA probe was obtained after cloning an RT-PCR fragment 
30 from mouse adipose tissue RNA (phmers ATG CGG GCT CCT GGA GCA GGA 
CAG CC and CTG CGT GTC AGG CAG GAT GCT CTC AGG CCC) into 



25 
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pBIuescript-KS. The insert was sequenced and found to be identical to the 
reported mouse FATP sequence. The rat ACS probe corresponds to the 
EcoRV restriction fragment of the rat ACS cDNA and an EcoRI fragment of the 
hLPL26 clone was used to measure LPLmRNA. p-actin was used as control 
probe. 

Isolation of nuclei and transcriptional rate assay 

Nuclei were prepared from FAO and 3T3-L1 differentiated cells and 
treated either with 9c-RA or vehicle. Transcription run-on assays were 
performed as described by Nevins (Nevins. 1987). Equivalent counts of nuclear 
RNA labeled with [a-^^P]UTP (3000 Ci/mmol) were hybridized for 36 hr at 65°C 
to 5 pg of FATP, ACS, GAPDH and vector DNA (pBluescript) immobilized on 
Hybond-C Extra filters (Amersham). After hybridization, filters were washed at 
room temperature for 10 min in O.SxSSC and 0.1% SDS and twice at 65X for 
30 min and subsequently exposed to X-ray film (BIOMAX-MS, Kodak). 
Quantitative analysis was perfomned by scanning densitometry (BioRad GS670 
densitometer). 

Preparation of albumin-bound fatty acids and Fatty acid uptake assay 

Radiolabeled ^"^[C] oleate fatty acid was added to water at 40*'C. Albumin 
(BSA; fraction V, fatty acid free. Sigma, St Louis, Mo) was then added from a 
concentrated stock (20 g/100 ml) to give a final molar ratio of 1/1 by gentle 
mixing. 2 X Hank's solution was added to obtain a 1 X final solution. Incubation 
was carried out at 37X for 45 minutes. 

The measurement of uptake of V'^[C]-labeIed oleate (about 50 
mCi/mmol, NEN. Boston, MA) was carried out in 24 or 6 well plates with 10^ 
cells / ml of medium. Before treatment, the cells were washed with 1X Hank's 
solution. BRL 49653 (100 - 250 nM) and fenofibric acid (100 - 250 pM) and the 
other retinoids (dose range from 10"^ M to 10*^ M) were added in fresh DMEM 
medium containing 10% FCS. After 48 h of treatment, cells were washed with 
Hank's solution and incubated for one additional hour in serum-free, glucose- 
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free medium. Cells were then washed once at 37**C and twice at 23 ''C with 1X 
Hanks solution containing BSA. Hank's solution without BSA was then added 
before the assay. A volume corresponding to 1pCi of ^"^[CJ-oieate albumin- 
bound solution was added in each well and cells were incubated for 1 min at 
5 room temperature. Incubation was stopped after 1 min with 3 washes of ice 
cold 1X Hank's solution without BSA. A complementary experiment has been 
performed to verify whether a specific cell surface binding of ^^[C]-oleate could 
interfere with the assay. For this second assay, the cells were washed under 
more stringent conditions in 1X Hank's solution containing 0.5% BSA. Cells 
10 were then lysed in 400 pi of 0.1% SDS solution. The lysate was counted for 5 
min with 4 ml of scintillation solution. Assay was performed on triplicate points. 



15 1.9C-RA induces FATP and ACS mRNA levels in the hepatoma cells FAQ and 
H g p Qg 

In order to determine whether FATP and ACS genes were regulated at 
transcriptional level by retinoic acid, a dose-response experiment was 
performed. FAO cells were cultured for 6 hr with increasing doses of 9c-RA 

20 (10 to 10"^ M) whereas control ceils were incubated with vehicle alone (0.1% 
v/v DMSO). incubation with 9c-RA resulted in a dose-dependent increase of 
both FATP and ACS mRNA levels in this hepatic ceil line (figure 10A). A 
maximal increase for FATP (>9-fold) and ACS (14-fold) was observed with a 
dose of 10'^M 9c RA. FATP and ACS m RNA are also upregulated by at-FlA but 

25 to a lesser extent that 9c-RA (data not shown). 

Induction of FATP and ACS mRNA levels in the human hepatoma cell 
line HepG2 was analyzed, 9c-RA induced FATP and ACS mRNA after 24 hr 
(dose 10'® M) (figure 10B). However, no effect of at-RA could be detected (data 
not shown). 



Results 



30 
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2. 9c-RA induces FATP and ACS mRNA levels in differentiated Caco2 and 
3T3-L1 ceils . 

Since adipose tissue is an important insulin-sensitive tissue, it was 
5 interesting to know whether retinoids also regulate FATP and ACS gene 
expression in mouse adipose cell line. 3T3-L1. 3T3-L1 cells were therefore 
completely differentiated and then treated with 9c-RA (10'® to 10"^ M). 9c-RA 
induces FATP and ACS mRNA expression in a dose-dependent manner. 
Maximum increase was observed at a concentration of 10*^ M 9c-RA (figure 
10 11 A). 

The expression of FATP and ACS induction was next compared between 
Fao hepatoma cells and the colon adenocarcinoma cells Caco2. Both FATP 
and ACS gene expressions are increased after 6 hr of treatment with 9c-RA at 
10*^ M. However, higher levels of induction are nevertheless observed in FAO 
15 cells (figure 1 1B). 

3. The induction of FATP and ACS oene ex pression bv 9c-RA is at the 
transcriptional level. 

Nuclear run-on analysis was next carried out to determine whether the 
20 induction of this expression by 9c-RA was a direct consequence of enhanced 
gene transcription. FAO cells were therefore treated for 2 hr with 9c-RA (10"^ M) 
after 18 hr of culture in serum-deprived medium, whereas 3T3-L1 ceils were 
treated for 24 hr with 9c-RA (10'® M) in normal medium without serum 
deprivation. FAO metabolism is greatly affected with lipids, serum deprived 
25 conditions allow to observe a strong effect which couldn't be detected in normal 
conditions. Controls were treated with vehicle only. Transcription rates for the 
FATP and ACS genes were induced 2-fold and 2.8-fold respectively in FAO 
ceils and 4- and 3-fold respectively in differentiated 3T3-L1 cells. In the retinoic 
acid treated cells, transcription of the GAPDH gene was not affected in neither 
30 of the two ceil lines. 
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4. Rexinoids. but not RAR agonists, induce FATP. ACS and LPL gene 
expression in differentiated 3T3-L1 cells . 

5 To determine whether the effect of retinoids on the regulation of FATP 

and ACS gene expression was mediated by RAR or RXR. 3T3-L1 differentiated 
cells were treated with 9c-RA, a panagonist of both RAR or RXR, or at-RA and 
TTNPB which are specific agonists of RAR. No activation of LPL. FATP and 
ACS gene expression was detectable after 24 hr of treatment with at-RA or 

10 TTNPB. In contrast, a strong dose-dependent induction of both LPL, FATP and 
ACS was observed 24 hr after treatment with 9c-RA, a RXR agonist. Maximum 
induction occurred at 10"^ M 9c-RA (4-fold and 3-fold for LPL. FATP and ACS 
respectively). Interestingly, no effect of non of the retinoids was observed on 
non-differentiated 3T3-L1 preadipocyte cells (figures 13. 14 and 15). 



RXR agonists, but not RAR agonists, induce oleate uptake in 
differentiated 3T3-L1 cells. 

To establish whether the induction of the FATP-1 and ACS was accompanied 
20 by a concommitant increase in fatty acid uptake into the cells, we measured 
^"^[C] oleate uptake in differentiated 3T3-L1 cells exposed to increasing doses of 
prototypic retinoid, i.e.. TTNPB. an RAR agonist (dose range 10'^ M to 10"^ M). 
No effect on ^'^[C] oleate uptake was observed after TTNPB confirming that this 
effect was specific for the rexinoids (figure 16). 



Discussion 

These results demonstrate that RXR ligands control both FATP and ACS 
gene expression in several cell types. In the liver. FATP and ACS have been 
30 shown to be strongly upregulated by PPARa activators (Schoonjans et a!., 
1995; Martin et aL. 1997). These distinct transcriptional effects are mediated by 



15 



25 
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PPARa interacting with responsive elements in the promoter regions of these 
genes. As PPARa effects are mediated through activation of the PPARa-RXR 
heterodimer, it was not surprising to observe a strong induction of FATP and 
ACS expression by specific RXR activators. Expression of FATP and ACS are 
co-induced by 9c-RA in the liver and in intestine. Free fatty acids generated by 
LPL are avidly taken up by these tissues and are converted in metabolic active 
acyl-CoA derivatives to sustain a high level of p-oxydation. The co-induction of 
FATP and ACS by 9c-FlA, which resembles the induction of these genes by 
fibrates, indicates that 9c-RA and fibrates have a similar metabolic effect on rat 
and human hepatoma cell line (Kliewer et aL). These data demonstrate that the 
RA pathway is implicated in the regulation of lipid metabolism and underline the 
powerful action of specific RXR agonists on the uptake and metabolism of long- 
chain fatty acids in the liver. 

In adipose tissue, a second insulin-sensitive tissue. FATP and ACS are 
also co-regulated by 9c-F^^ in a dose-dependent manner. PPARy is the 
predominant in this tissue and PPARy activators have been shown to induce 
both FATP and ACS expression. Hence, the induction of FATP and ACS 
expression by RXR agonists in the context of PPARy-RXR heterodimer 
suggests that RXR is also here an active heterodimer partner. 

The stimulatory effects of 9c-RA on FATP and ACS gene expression in 
both tissue implicate the RA pathway as an important regulation pathway in 
liver and adipose tissue where retinoids have a significant physiological role in 
fatty acid metabolism. Since FATP and ACS genes are implicated in fatty acid 
partitioning, it can be concluded that the distribution of fatty acids is affected in 
response to 9c-RA. This effect of rexinoids on fatty acid partitioning is different 
from the effects observed with PPAR activators and depends on the relative 
expression of the different FJXRs and PPARx (Braissant et a!., 1994; Lemberger 
et al. 1996; Auboeuf et al.. 1997). RXR is present in both liver and adipose 
tissue and this pattern of expression determines the tissue specific effects of 
RA activation. In contrast, activity of PPARa agonists is mostly limited to the 
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liver, the prime site of expression of PPARa. Furthermore, the fact that the co- 
regulation of LPL. FATP and ACS genes by 9c-RA. TTNPB and at-RA depend 
on the differentiated state of 3T3-L1 ceils points to the importance of PPArg in 
this process. In preadipocyte cells PPARy is expressed at low levels and its 
5 expression increases upon adipocyte differentiation. The absence of a retinoic 
acid response in differentiated 3T3-L1 cells expressing almost no PPARy 
suggest that PPARy is an obligatory partner of RXR in the retinoid-dependent 
regulation of LPL, FATP and ACS genes in adipose tissue. Hence, due to its 
more general expression, it is expected that the effects of rexinoids are more 

10 generalized than the effects of the more tissue-restricted PPARa (liver) and 
PPARy (adipose tissue) agonists. In fact, rexinoids effects on gene expression 
should resemble the effects of a combined PPARa and y agonist. This effect is 
direct on the FATP gene and suggests the presence of a response element like 
it has been demonstrated in ACS and LPL gene (Schoonjans et al., 1996) but 

15 also and recently in FABP gene (Poirier et al., 1997). 

It seems that rexinoids. the new term used to specify RXR selective 
ligands collaborate with PPAR ligands to control the expression of FATP'and 
ACS genes involved in fatty acid metabolism. The beneficial effect of PPARy 
activators on glucose homeostasis has been previously shown by us to be in 

20 part due to a redistribution of fatty acids towards adipose tissue with a relative 
depletion of fatty acids in the muscle (Martin et a!.. 1998). As known since 
Randle's work in the 60 (Randle et a!., 1963; Randle et al., 1964), depletion of 
muscle fatty acid content will result in an improvement of glucose homeostasis. 
Hence, it is tempting to speculate that the improvements of glucose 

25 homeostasis observed with rexinoids are similarly linked to an altered 
partitioning of fatty acids. In view of the distinct tissue distribution of RXR 
expression, induction of FATP and ACS by RXR ligands might not have the 
same tissue-specificity as that of PPARa and PPARy activators and could 
contribute to a different tissue partitioning of fatty acids. These agents provide 

30 hence an alternative way to regulate the expression of genes implicated in fatty 
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acid distribution among the different tissues. 

This study demonstrates that RXR ligands regulate the expression of 
LPL, FATP and ACS genes in several cell types. The retinoic acid pathway is 
an important signalling pathway for the regulation of genes which are implicated 
5 in fatty acid supply. Our results suggest that the PPAR-RXR complex is the 
molecular target by which rexinoids regulate FATP and ACS gene expression. 
Furthermore, they define FATP and ACS as new target genes in the RXR- 
dependent signalling pathway. The observation that both rexinoids and 
peroxisome proliferators stimulate the transcription of these genes implicated in 

10 lipid metabolism, suggest that rexinoids may act as an hypolipidemic and 
hypoglycemic agent through activation of PPAR-RXR complex and are 
consistent with the hypothesis that the heterodimer PPAR-RXR heterodimer is 
the molecular target for the improvement of insulin sensitivity. 
TTNPB, which is an activator of RAR in the context of RXR-RAR heterodimer, 

15 has no effect on FATP-1 and ACS. The combined results of the studies using 
these synthetic retinoids implicates the PPAR-RXR heterodimer as the 
molecular target of the regulation of FATP-1 and ACS by retinoic acids. Since 
PPAR and RXR are both active components of this heterodimer and since we 
have previously demonstrated that the FATP-1 and ACS genes are both 

20 regulated by PPAR activator (Martin, G. et al (1997) J. Biol. Chem. 272:28210- 
7). it appears that rexinoids could cooperate synergistically with PPAR ligands 
in the control of the expression of these two genes involved in fatty acid 
metabolism. Furthermore, the fact that the coregulation of FATP-1 and ACS 
genes by rexinoids depends on the differentiated state of 3T3-L1 cells points to 

25 the importance of PPARy in this process. In preadipocyte cells PPARy is 
expressed at low levels and its expression increases upon adipocyte 
differentiation. The absence of a rexinoid response is undifferentiated 2T3-L1 
cells, expressing almost no PPARy, hence also suggest that PPARy is an 
oligatory partner of RXR in the retinoid-dependent regulation of FATP-1 and 

30 ACS genes in adipose tissue. 
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Expression of FATP in human tissues 

5 In order to determine whether the expression of FATP was ubiquitous or limited 
to some tissues, hybridization experiments were performed with a human probe 
radioactively labeled on a commercial Northern Blot (Clontech) containing 
mRNA of 8 human tissues. These tissues are heart, brain, placenta, lung, liver, 
skeletal muscle, kidney, and pancreas tissues (figure 14). The expression is 

!0 very high in the skeletal muscle, high in heart and pancreas, medium in brain, 
weak in placenta, liver, and very weak in kidneys. The human mRNA 
corresponding to FATP has a size of 4.4 kb. The human p-actin probe has 
been used for the normalization and the top signal represents the ubiquitously 
expressed isoform, whereas skeletal and cardiac muscle express a specific p- 

15 actin isoform. 

The commercial Northern Blot did not contain adipose tissue probably due to 
difficulties in performing its extraction. Therefore, a supplementary Northern 
Blot analysis was performed on various human cell lines and adipose tissue. 
20 The expression of FATP was analyzed. FATP was very highly expressed in 
adipose tissue. To a weaker extent, FATP was expressed in hepatocyte and 
monocytic cell lines. The expression was detectable in chorionic carcinoma 
cells (Jeg3) and human colon cells (Caco2). 

25 The high level of FATP mRNA in the skeletal muscle has been checked on 
several skeletal muscular tissues and the level of expression of the FATP 
mRNA in this tissue has been compared with the level in sub-cutaneous or 
visceral adipose tissues, liver and colon tissues (figure 17). 

30 Interestingly, the expression level in the human muscle (lines 1. 2. and 3) has 
been confirmed at a level which is close to the level observed in adipose tissue. 
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which was not the case when the same experiments were perfomied on 
rodents. The expression of FATP is very high in the adipose tissue. In normal 
patients, the adipose tissue is sub-cutaneous. In patients suffering from obesity, 
the adipose tissue is visceral and sub-cutaneous. In these patients, the 
5 expression of FATP is higher at the level of the sub-cutaneous tissue. 

FATP is thus expressed ubiquitously in human tissues, with a predominant 
expression in adipose tissue, muscle and heart. 

10 As a transport protein, FATP would permit transportation of fatty acids in the 
adipose tissue in nutrition periods and would be involved in efflux of said fatty 
acids in fast periods. Furthermore based on the obsen/ation that FATP is highly 
expressed in skeletal muscle, it is concluded that FATP is very important as 
energy source for a tissue which is known to use mainly glucose in normal 

15 conditions. This result emphasizes a competition between glucose and long- 
chain fatty acids for the muscular activity. 

The third tissue which expressed a high level of FATP is heart. This expression 
is not influenced by nutrition. In brain tissue the level of FATP is significant. In 
20 addition to its activity as energy source, FATP could participate in the 
incorporation of lipids for synthesis of axial membranes and myelin. In the 
pancreas, FATP could be the specific transport protein of long-chain fatty acids 
messengers essential for the regulation of the production of insulin by the 
pancreas. 



It is especially noted that the level of FATP is especially high in tissues which 
are sensitive to insulin such as adipose tissues and muscle. 



25 



Western blot analysis of FATP. 

30 Cells and tissues were homogenized in a lysis buffer of PBS containing 1% 
Triton X-100 (Sigma. St Louis, MO). Tissues were homogenized with a polytron 
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in extraction buffer containing PBS and 1% NP-40. 0.5 % sodium deoxycholate. 
0.1% SDS (Sigma, St Louis. MO). In the presence of a fresh cocktail of 
protease inhibitors (ICN. France) (100 pg/ml AEBSF, 5 mg/mi EDTA, 1pg/ml 
leupeptin, 1 pg/ml pepstatin). Protein extracts were obtained by centrifugation 
of the lysate at 4*^0 and then concentration was measured with the Biorad DC 
Protein colorimetric assay system (Biorad. France). 

Protein (100 jjg) were separated on a 10% polyacryiamide gel according to 
Laemmli [Laemmli. 1970 #1541]. transferred to nitrocellulose membrane as 
described by Towbin [Towbin. 1979#1542] (Amersham, France), and blocked 
overnight in blocking buffer (20 mM Tris, 100 mM NaCI. 1% Tween-20. 10% 
skm milk). Filters were first incubated 4 hours at 21**C with rabbit IgG anti- 
mFATP (10 mg/ml) developed against a FATP peptide, corresponding to amio 
acids from human FATP-1. and next for 1 hour at 21**C with a goat anti-rabbit 
IgG (whole molecule) peroxidase conjugate (Pasteur Diagnostic Sanofi. 
France) diluted at 1/5000. The complex was visualized by chemiluminescence 
using with 4-chloro-1-naphtol as reagent according to the manufacturer's 
protocol (ECL. Amersham. France). 

Northern blot analysis of different human cell lines, such as NCL-H295. THP-1. 
Caco2. chorionic cells and hepatoma cells shows that hFATP-1 can be 
detected in ail these cells but to a relatively lower extent than in human adipose 
tissue (figure 18). hFATPI mRNA is relatively will expressed in Caco2 cells and 
in cells of hepatic origin. 
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CLAIMS 



15 



20 



1. Nucleotide sequence which comprises a sequence Involved in the 
expression of the human Fatty Acid Transport Protein (hFATP) comprising the 
aminoacid sequence of Figure 2 or Figure 5. 

2. Nucleotide sequence according to claim 1, which comprises a 
sequence encoding the human FATP. corresponding to or comprising the 
nucleotide sequence of Figure 1 or Figure 3. 

3. Nucleotide sequence according to Claim 2. which is the nucleotide 
sequence of Figure 1 or Figure 3. 

4. Nucleotide sequence according to Claim 2, which comprises an ORF 
sequence located between nucleotide 23 (ATG) and nucleotide 1963 (TGA) of 
Figure 1 or Figure 3. 

5. Nucleotide sequence hybridizing in high stringency conditions, with a 
probe comprising 50 to 300 nucleotides including at least 6, preferably at least 
9 nucleotides from the following sequence : 
CGGGGAGACGGGACGTGAAGGG. 

6. Nucleotide sequence according to anyone of claims 1 to 5 which is a 
genomic DNA. 

7. Nucleotide sequence according to anyone of Claims 1 to 6, which is 
the gene coding for the hFATP. 

8. Nucleotide sequence according to anyone of Claims 1 to 3 or 6 to 7, 
which comprises a transcription initiation site 61 bp upstream from the ATG 
codon. 

9. Nucleotide sequence according to anyone of Claims 6 to 8, which 
contains 12 exons and 1 1 introns having the following structure: 
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167 562 724 886 997 1206 1333 1471 1636 1783 



1 


2 


3 


4 
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6 
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9 


10 


11 


12 



ATG TGA 



10. Nucleotide sequence according to claim 9. which comprises the 
nucleotide sequence of Figure 4. 

11. Nucleotide sequence according to anyone of claims 1 to 10» which 
comprises sequences involved in the regulation of the expression of the gene 
coding for the human FATP, 

12. Nucleotide sequence according to anyone of Claims 1 to 11, which 
sequence either specifically hybridizes with one or both of the primers having 
one of the following sequences or is the amplification product obtained with the 
following sequences, or hybridizes in high stringency conditions with said 
amplification product : 

AAGGTCAATGAGGACACAATGG, 
CGAGTAGGTAGTGATCGTGCAG. 

13. Nucleotide sequence according to anyone of claims 1 to 12, which is 
a genomic sequence coding for FATP and which hybridizes in high stringency 
conditions with a probe containing around 200 bp and being preferably derived 
from the sequence of exon 2, said genomic nucleotide sequence comprising a 
sequence involved in the regulation of the gene coding for the FATP. 

14. Nucleotide sequence which is a mRNA as obtained by transcription 
of a genomic nucleotide sequence coding for the hFATP according to anyone 
of Claims 6 to 13. 

15. Nucleotide sequence according to anyone of Claims 1 to 13 which is 
the RNA sequence transcribed from the nucleotide sequence of figure 1 or 
figure 3 having 2 kb, or an RNA sequence including a nucleotide sequence of 
4.4 kb hybridizing in high stringency conditions with the sequence of figure 1 or 
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figure 3. 

16. cDNA as obtained by reverse transcription of the mRNA according to 
Claim 14 or 15. 

17. Polynucleotide which is selected among the following sequences: 

- AAGGTCAATGAGGACACAATGG 

- CGAGTAGGTAGTGATCGTGCAG 

- a sequence comprising or corresponding to sequences involved in the 
regulation of the expression of the gene encoding the hFATP, 

- any fragment derived from the nucleotide sequences disclosed in 
figures 1, 3 or 4, for instance by deletion mutation or insertion provided 
the essential biological properties of the native sequences are 
maintained. 

18. Vector comprising, inserted in a site not essential for its replication, a 
nucleotide sequence according to anyone of claims 1 to 16. 

19. Vector according to Claim 18, which further comprises a reporter 
sequence such as the CAT gene or the luciferase gene. 

20. Vector according to Claim 19, wherein the expression of the reporter 
gene is under the control of an heterologous regulator region. 

21. Recombinant cell which comprises a nucleotide sequence according 
to anyone of claims 1 to 16 or a vector according to anyone of Claims 18 to 20. 

22. Recombinant ceil according to Claim 21 which is a eucaryotic cell. 

23. Human Fatty Acid Transport Protein (hFATP) comprising the 
aminoacid sequence of figure 2 or figure 5. 

24. Human FATP according to Claim 23. which has an isoelectric point of 
8.5 and which replies to the aminoacid sequence of Figure 2 or Figure 5. 

25. Human FATP which is encoded by a nucleotide sequence according 
to anyone of Claims 1 to 16. 

26- Human FATP which is devoid from its signal peptide. 

27. Human FATP according to anyone of claims 23 to 26, which is 
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glycosylated. 

28. Antibodies specifically directed against hFATP according to anyone 
of Claims 23 to 27. 

29. Composition for the therapeutic modulation of the intracellular level of 
long-chain fatty acids, which comprises a component capable of regulating the 
expression of the gene encoding hFATP. 

30. Composition for the therapeutic modulation of the blood level of long- 
chain fatty acids, which comprises a component capable of regulating the 
expression of the gene encoding hFATP. 

31. Composition for the treatment of a pathological state associated with 
a deficient regulation of the intracellular level of long-chain fatty acids, which 
comprises a component capable of regulating the expression of the gene 
encoding hFATP. 

32. Composition according to claim 30 or 31. which comprises an 
inhibitor of the expression of the hFATP gene. 

33. Composition according to claim 30 or 31. which comprises a 
compound capable of enhancing the expression of the hFATP gene. 

34. Composition according to claim 33 for the treatment of 
cardiomyopathies or diabetes. 

35. Composition according to claim 32 for the treatment of obesity. 

36. Method for the screening of the effect of a determined compound, on 
the expression of the hFATP protein in determined cells, which method 
comprises the step of detecting the transcription of the mRNA in the cells or cell 
extracts, after contacting the cells with said determined compound, in 
conditions enabling the interaction between said compound and said cells. 

37. Method according to claim 36. for the screening of the effect of a 
determined compound on the regulation of the expression of the hFATP protein 
in cells, which comprises: 

a) measuring the level of transcription of the mRNA in cells or ceil 
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extracts, wherein the cells have previously been contacted with the deternnined 
compound, in conditions enabling the interaction of said cells and said 
determined compound; 

b) measuring the level of transcription of the mRNA in the same cells 
species as in step a) or on extracts of these cells, wherein these cells have not 
been previously contacted with the assayed compound; 

c) comparing the level of transcription obtained in steps a) and b). 

38. Method according to claim 36 or 37. wherein the cells used are 
selected among the group of liver cells, heart ceils, adipose tissue cells and 
skeletal muscle cells. 
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CGGGG.— -\^ACoooA\-GXGAAGGGAXGCG-3GCr - CCGGGXGCG 

ATGCG 

1 



41 

GGCGCGGCCTCGGTGGTCTCGCTGGCGCTGTTGTGGCTGC 
GGCGCGGCCTCGGTGGTCTCGCTGGCGCTGTTGTGGCTGC 

41 

81 

TGGGGCTGCCGTGGACCTGGAGCGCGGCAGCGGCGCTCGG 
TGGGGCTGCCGTGGACCTGGAGCGCGGCAGCGGCGCTCGG 
81 

121 

CGTGTACGTGGGCAGCGGCGGCTGGCGCTTCCTGCGCATC 
CGTGTACGTGGGCAGCGGCGGCTGGCGCTTCCTGCGCATC 
121 

161 

GTCTGCAAGACCGCGAGGCGAGACCTCTTCGGTCTCTCTG 

GTCTGCAAGACCGCGAGGCGAGACCTCTT 

161 

201 

TGCTGATCCGCGTGCGCCTGGAGCTGCGGCGGCACCAGCG 

189 
241 

TGCCGGCCACACCATCCCGCGCATCTTTCAGGCGGTAGTG 

GCGGTAGTG 

189 

281 

CAGCGACAGCCCGAGCGCCTGGCGCTGGTGGATGCCGGGA 
CAGCGACAGCCCGAGCGCCTGGCGCTGGTGGATGCCGGGA 
199 

321 * 

CCGGCGAGTGCTGGACCTTTGCGCAGCTGGACGCCTACTC 
CCGGCGAGTGCTGGACCTTTGCGCAGCTGGACGCCTACTC 
239 

361 

CAATGC3GTAGCCAACCTCTTCCGCCAGCTGGGCTTCGCG 

CAATGGGGTAGCCAACCTCTTCCGCCAGCTGGGCTTCGCG 

279 

401 

CCGGGC3ACG' - GG X GGC CATCTTCCTGGAGGGC 
CCGGvj^- — A\— w 1. i. GGCCATCTTCGTGG-nvjGGCCGGCCGG 
319 

441 

AGT - — — 7 — w-^\»T3 - ^^Q'Z'3^^C2^'Z^'^ 3 CAAG GCGGGCAT 
AGT - — — — wC-Gi'^wv- - GGG C CI* 33 3 3AA 3 333 3G CAT 
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Fia 1B 

481 

GGAGGCCGCGCTGCr"^:\CGTGAACCTGCGGCGCGAGCCC 
GGAGGCCGCGCTGCTGAACGTGAACCTGCGGCGCGAGCCG 
399 

521 

CTGGCCTTCTGCCTGGGCACCTCGGGCGCTAAGGCCCTGA 

CTGGCCTTCTGCCTGGGCACCTCGGGCGCTAAGGCCCTGA 
439 

561 

TCTTTGGAGGAGAAATGGTGGCGGCGGTGGCCGAAGTGAG 

TCTTTGGAGGAGAAATGGTGGCGGCGGTGGCCGAAGTGAG 
479 
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hFATP' 
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hFATP' 



hFATPl 
hFATP ' 



-^UTR 
'UTR 



-^UTR 
+ UTR 



+ QTR 
+UTR 



601 



CGGGCATCTGGGGAAAAGTTTGATCAAGTTCTGCTCTGGA hFATPl +utr 



641 



GACTTGGGGCCCGAGGGCATCTTGCCGGACACCCACCTCC hFATPl -f-UTR 



681 

TGGACCCGCTGCTGAAGGAGGCCTCTACTGCCCCCTTGGC 
721 

ACAGATCCCCAGCAAGGGCATGGACGATCGTCTTTTCTAC 



hFATPl +UTR 
hFATPl +UTR 



761 



ATCTACACGTCGGGGACCACCGGGCTGCCCAAGGCTGCCA hFATPl i-UTR 



801 

TTGTCGTGCACAGCAGGTACTACCGCATGGCAGCCTTCGG 



hFATPl T-UTR 



841 

CCACCACGCCTACCGC 



AGGCGGCTGACGTGCTCTAT 



hFATPl -UTR 



881 

GACTGCCTGCCCCTGTACCACTCGGGAGGAAACATCATCG 



hFATPl -^UTR 



921 

GCGTGGGGCAGTGT' 



rATGGGCTGACAGTGG" 



hFATPl ^UTR 



961 

CCGCAAGAAATTCrrGGrrAGCCGCTrCTGGGACGACTGT 
1001 

ATCA-^GTACAACTGrAC^GTGGTTCAGTACATCGGGGAGA 



hFATFl -UTR 



1 FAT PI -JTR 
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FIG. 1C 

1081 

GAGGC3ACACCGCGTGCGCCTGGCGGTGGGGAACGGGCTG 



GGT'Gw * G AT CT*GG\jAGG AG T"!* GAG GG AG CGGT"TGGG CG 
1161 

TACGCCAAATGGGGGAGTTCTACGGCGCGACCGAGTGCAA 
1201 

CTGCAGCATTGGCAACATGGACGGGAAGGTCGGGTCCTGT 
1241 

GGTTTGAACAGGCGCATCCTGCCCGACGTGTACCCCATCC 
1281 

GGCTGGTGAAGGTCAATGAGGACAGAATGGAGGTGCTGCG 
1321 

GGATGCCCAGGGCCTCTGCATCCCCTGCCAGGCCGGGGAG 
1361 

CCTGGCCTCGTTGTGGGTCAGATCAACCAACAGGACCCGC 
1401 

TGCGCGGCTTCGATGGCTATGTCAGCGAGAGCGCCACCAG 
1441 

GAAGAAGATCGGCCACAGCGTCTTGAGCAAGGGCGACAGC 
1-181 

GGCTACCTCTCAGGTGACGTGCTAGTGATGGATGAGCTGG 
1521 

GCTACATGTACTTCGGGGACCGTAGCGGGGACACCTTCCG 
1561 

CTGGCGAGGGGAGAACGTCTCCAACACCGAGGTGGAGGGC 
1601 

oTGC — :Av-jGCGCCTGCTGGGCCAGACAGACGTGGCGGTCT 

164 1 

.~\ i -w-«-r*^-x^ _ ^■^Vw w J. ^ v_ \_AvjGAG i ^-^AvjTv-:-^ _ AAGGCAGGGAT 
15 31 

— . ; - \_ 3CAGACC CCCACAGC ZTG GTGGACGCCAAC 
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FIG. 1D 

1761 

CCCGGCCCAT> 



GCCCCAGGTGGACAC hFATPl -UTR 



1801 

CACAGGCACC7TCAAGATCCAGAAGACGAGGCTGCAGCGA 



hFATFl ^UTR 



1841 

GAGGGCTTTGACCCACGCCAGACCTCAGACCGGCTCTTCT 



hFATPl -^UTR 



1881 

TCCTGGACCTGAAGCAGGGCCACTACCTGCCCTTAAATGA 



hFATPl +UTR 



1921 

GGCAGTCTACACTCGCATCTGCTCGGGCGCCTTCGCCCTC 



hFATPl +UTR 



1961 

TGAAGCTGTTCCTCTACTGGCCACAAACTCTGGGCGTGGT hFATFl ^UTR 

2001 

GGGAGAGGCCAGCTTGAGCCAGACAGCGCTGCCCAGGGGT hFATPl +UTR 



2041 

GGCCGCCTAGTACACACCCACCTGGCCGAGCTGTACCTGG 



hFATPl +UTR 



2081 

CACGGCCCATCCTGGACTGAGAAACTGGAACCTCAGAGGA 



hFATPl -rUTR 



2121 

ACCCGTGCCTCTCTGCTGCCTTGGTGCCCCTGTGTCTGCC 



hFATPl 4-UTR 



2161 

TCCTCTCCCTGCTTTTCAGCCTNTGTCTCCTTCCATCCNT 



hFATFl ^UTR 



2201 

GTCCCTGTNTGGCCTTAACCCG 



hFATPl -rUTR 
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M R A P G A G lAj A S V |V| S L A L L W 
A3G 023 GCT 003 QST QGG G3C tHi GCC 'ITG G3j GTu TO:^ CIG OuG CE TIG TG3 

Mr. APGAGTASVASLALLW 
A3G CG3 GZZ CCT 03% OA GS^i G::C "TZT GGC: TCA dG rm CIE CTT TGG 

M R PGAGTASVASL [gJ L L W 

AIG CE3 JCT OOG QGA GOV GCC TZT GIG OJZ 1CA TIG G33 CIG CTT ICG 



L 
L 
L 



61 



GLPWTWSAAAA GHCi] V Y V G S G G 
OGG CTG OCG T3G ^a: TOG X3C G33 GCA OGG GCG CICGSZ GIG TFC GIG G3::.?a: 033 GQC 
GLPWTWSAAAAFCVYVGGGG 

cxA err OOG iog ^oc tog jcc ccc gzg an ccn tiu iul gig tpc gig ogt ox gx: oj: 

GLPWTWSAAAAF [gJ V Y V G S G G 
OGA err CCG tog KI: tog A3C GCG OTG CIG GOG TICaGrGIG TTC G3G OJTPCC GGT (IE 



hFATPl 
mPATP 
rFATP 

hFATPl 
mFATP 
rFATP 



121 



WRFLRIVCKTARRDLFGLSV 
TOG a3C TIC CIG OaG A3C GTC TOC JAG at: GOG MG 03^ <3C etc hFATPl 

WRFLRIVCKTARRDLFGLSV 
TOG GQC TIT CIG CCT A3C GTC TOG ^AG KG OOG PCC OGA CIC TZT OGT CIC TCT GTT TuFATP 

WRFLRIVCKTARRDLFGLSV 
TOG CSV TTT CIG OGT MC GTC TX MG XG GCG PGG CSV CIC CIC TIT GGT CIC ICT GET rFATP 

181 LIRVRLELRRH fo] RAG [n] TIP [T] 
CIG JdC OX GTO a3C CIG CIG fGO 033 CJC CT3CGT QDC OX: fec AOCAIC OOG TS: 

LIRVRLELRRHRRAGDTIPC 
CIG JCrr GGT GTr G3G CJPl dC GGA GGV C?C OGGGGA O:^ OJV GKT AOGAX: CE 

LI RVRLE LRRHRRAG DTIPfR 

CIG ot: coc gig a3G cm. g?c cm osa an ac oaGOGA gd^ osv gc aogaoc oga 

241 I F Q a V [v]S3 RQPERLALVDA [011^ 
ATO TIT CTG QCG GIA GIG CM CIA CAG dC CTG GGCCTO GOG C3G GTE GOTC3CC QGG Ai: 

IFQAVARRQPERLALVDASS 
AIC TIC CTG C3Cr CTC 001 rm nm CAA OCA G?G 032 CIG OCA CIG CJB3 C3CGCC JGT AST 

T F Q A V A [qJ RQPER LALVDASS 
AIC TIC CTG C3Cr: C3G CTC OTJ CAG OJ; CTG 03!: CIG ODG OE GIA GKT GOG PGT 




301 G IeJ C W T F 
GGG G53 TX TOG KC TIT 

G I C W T F 
OGP /OA TOG TOG AX TIC 

G I C W T F 
GGT AIC TX TOG AX TIC 



361 



A 0 L D 

axcJGCTGac 

A Q L D 
OCA CTG CTG GC 

A Q L D 
CCA CTG CTAOC 

P G D V 
CCG C3X G\C CJG 

P G D V 
CCA OX CAT GIG 

P G D V 
COV ox C?iT GIG 



YSNAVANLF 
ISCTOC -?ftT QCG GIA GCCAAT CIC TEG 
TYSNAVANLF 
AX TTCTX JftT (XT GIA QOTAA: CTO TSG 

TYSNAVANLF 
AX 17CTOC JAT OCT C?IG GXAAT CHG TIC 

V A (T] F L E G R P E 
GIG OX Sic TIC C3G GAS C3XCX CEG C?G 
VAVF.LEGRPE 
GIG GCrGTO TEC C3G G?G OX GX OCG GG 
VAVFLEGRPE 
GIG GCTGTO TTC OG CPA C2X CX CCC C?G 



421 



481 



R Q L G F A 
CGC CPG C3G GGC TTC C3X 
R Q L G F A 
pOC CPG CTG GQC TIT OCA 

[lJ q l g f a 

CTC CPG CriG C33G TTr OX 

FVGLWLG LAKAG H[e] A A L L N V 
TIC GTO OX CTO TOG CHG OX CX XC ;V=G GX GGCATO CPG OX OX CIG CIC A=C GIG 

FVGLWLG LAKAGVVAALLNV 
TIC GIG CIA C:TO TDG CIG OX CX XC A=G GXGGrGnGGIG OCT GCTCITCX .^AT GET 

FVGLWLG LAKAGV VAALLNV 
TIC GX CIA CTO TOG CX OX CX XC A^G GX GGT GIA GTG OCT OX CTTCX .^^T GX 
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A<=C CIG CTG OCX GG GX CTO GC 
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APC GG AX COG C?G OX CTG CTC TTC ' 
N L R R E P L A F 

.-^ CX AX CCG G?G CX CTT GX T?C ' 

54 1 ^ S ^ 

TTT Gj-i CjLA GAA AIG ^^^^ t f i ; GGG GTG ! 

^ G M A A •.• 

— ~— CjGG GoG gag ATG GCA CCG Cdl i_L*3 i 



L G T S iGj A f; ALT 
CX CrXACC TOG OX OCT AA3GX CTO AIC hFATPl 

L G T S A A K A L I 
CTG OX AGA TCA XT GX AA3GX CX ATT mFATP 



L. G 
X TTG dX; 



S A 

TCA CCr C3X GX XC ATT rFATP 

S [g] [h] L G K £ L 

,XGX CAT CXGXA^. .^JT TX hFATPl 

S E: Q L G 3 L 

CC CA3 C?G CTG GX .^X XT mFATP 



AIG I 



x: rFATP 
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SOliT] K 7 
L K r 

cic ?x znz 

L K 7 



CSGDLG ? E I GJ I L P D 
CSGDLGPESILPD 

'T^c 'zrr a?i cig g3G ozr g?g ax: crc ccr 

CSGDLG PEs|v|l?D 



~ TCT 03^ GnT C3G G3G CCT C?G A^T CT 



OCT GC 



661 D pITJlHe AQlTrXlp L A Q fll P 
G?CCCGCI3 CIG A^G=G OT TCTArroI: Or TIG GC?i C?G A3C OOC 
DPMLAEAPTTPLAQAP 

G?ca3:AaG err err cpg cog ooc a:c aj^ ooc cig gca. caa or 03^ 

DPMLAEAPTTPLAQAP 

GRcar AOG ciTGrrcsG ens or: a:c a3v ccccigggactg or cnv 



721 D D R L F Y 
GKT CGT err TIC uc 
D D R L F Y 
GOT GH* CCD CIG TTT TTC: 

D D R L r Y 
GOT COT GCG CTA TTT T?C 

781 VV HSRYY r[m]a AFGH H [T] Y 

Gic GIG c?c ux: ?a3'uc TTCcn: ATG OA ocr: Ticoar fx: m^m: 

VVHSRYYRIAAFGHHSYS 
GIG GIG C?C j^GT A3G TPC TWr CCC AIT GZT OZ: TIT 033 CTtf: CRT TtX: 

VVHSRYYRIAAFGHHSYS 
GIGGIG C?C ;=flC A33 TPC COT AFC OA CTC TICa33 CPC GCT TOT TTCAG^ 

841 AA DVLYD CL P LYHSA GNI fT] G 
GOG err GPC GIG OC TOT lOr CTG (JJZ CnC TTiCCfiC XG CEA OSV A^A3c!raC OC 
AADVLYDCLPLYHSAGNIMG 
CrrpGCCPCrGrGCrClOT GRCTOCCTGCCA CnCXACCTiC XT OA 003 A^^TAOC AIG QST 

A [nJ dvlydcl plyhsagnimg 
cxn 7x G?ir GIG CIC w to: cta ccr cic tac cac tea oa cog a;^ aoc ots cnr 




901 



VGQcfTllYGLTVVLRKKFSAS 
C?IG Q3G TGT OC AIC TOT QCE CTG lO^ GIC GICCIC CD3 A?G AMTICT33 OJO AT 

VGQCVIYGLTVVLRKKFSAS 
GIG 033 CA3 IOC GIT A3C IPC Q3G TTG AE G3G GTIACIG CD3 AA3 TXClCi: Q3C AX: 

VGQCmiYGLTVVLRKKFSAS 
GIG GGR C?G TGT TOt: Arc IPC 033 TTA A3G GTG GIACTC 033 PfC AA5 TIClCi: Q3C AC 



TAG A?C 



961 R F V; D D C III K 
OGC TIC TGG G=C GCIGT ^AA3 

R F W D D C V K Y 
OOC TIC To3 G^ IGF GTC AA3 1AC 

R F W D D C V K Y 
GGC TIC 7^3 G?C C?C IGT GIC AM TAT 



N C T 

itr 

N C T 

TCC 
N C T 

to: 



V V Q 
GIG GTT CTG 

V V Q 
GTG OG 

V V Q 
CHA GIG CA3 

1021 CRy:^^[^QpvR [e]\a} e [r1 R 
cr: car T?c cig gg a^; cag ctg geg ox: c^g gcgg=g ^cca 

CRVLLRQPVRDVEQR 
or CGT T?C CT3 CIG A33 G\G C333 CTT CCC G?C GIG C;::! CEA 
CRVLLRQPVRDVE [rJ R 
OCCG3 CTG CA A3G C?G OOG GTT 033 C?T GIA C;=G fe^ CQ3 



Y I G E I 
T7C AICG33 AIC 

Y I G E I 
TCAOA OGT CAA AIC 

Y I G E I 
TAT AICOGT CPA AIA 

H R V R L 

cx:ci3CGriG cdc og 

H R V R L 

C7C a33crK3 cnr gg 

H R V R L 

ca: ctcge; cnr cc 



hFATPl 
mFATP 
rFATP 

hFATPl 
mFATP 
rFATP 

hFATPl 
mFATP 

rFATP 

hFATPl 
mFATP 
rFATP 



1081 A 
G33 



V G 
GIG GGG . 



M G 



A V 
OOC GIG : 



C3CC 

ii4im 



oar 



CA-. A-rc 
C : 

CA3 .-33 



L R P A I W E 
CIG CGT err GTC AIT TG3 

L R P A I W 
CZ; G33 OCA GCC AIC TG3 G=G 

L R P A : W £ 
033 G33 CXA CTC TG3 

F I- G A T £ 
TTT T?^ 033 GTC KT G^3 



' C333 G3T PCZ CP3 
GATE 



F T [e] R F G V 

rrr AU GA5 OCT TIC GCC OA hFATPl 

--FTQRFGV 

1TC A33 CAT pC2 TIC CGT G33 mFATP 

r T Q |gJ F G V 
~ C^G&TTIC GJr GE rFATP 



c 



I A N M D 

ATP OOC aa: aig gc 

I A N M D 
ATT G33AAr ATG C^C 

^ ^ 

ATT 333 APC ATG CAT 



hFATPl 
mFATP 
rFATP 
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:201G K V G S C G 
rnr PrG crS2 03Z TDC TCT G!?r T 

G K V G S C G 
G3CPrG GIC C3QC XC ICC GGC T 

G K V G S C G 
QQC PrG G3C G3r "EC TOT GST T 

L261L V -K V N E D ' 
CIG CSIG A;G GIC «ff G?G G?C 

L V K V N E D • 
CIG GTC M3 GIC OT* GG A 

L V K V N E D ' 
CIG GTC APG GIC ffC GG A 



r N S R I L |Pj H V V ? I R 

Tr;^c AT oi: A3CCIG mc oc CTE i?ccGC ot: cog hFATTI 

FNSRILTHVYPIR 

:c =Ac ^o: OCT aic cic ckt gig m: cr: ;3C cnr mFATP 

F N S .R I L T H V V p T r 

;g: cnr aic cic otr ge czz ptc ggt rFATP 



M E LiJ L R D tAJ 
. ATGGG CiG CIGOE ST < 
M ■ E P L R D S 

; ATGQC OCA ciG cm GCBr , 

M E P L R D S 

; ;?rG gg cigmg gfc inc 



G L C I 

G3Z cz: "ZDC m: hFATPl 

E G L C I 

2G CjGC ct:: id:: m: mFATP 

Q| G L C I 

^ri ccc dc TTT* ffTP rFATP 



1321P CQAGEPGLLVGQINQQDPL 
CDC TDC C7G QGC GG3 C?G CTT OX CTC CTT CT3 QGTCJG OTi: C7A C^GC CGG C3D 

pcqpgepgllvgqinqqdpl 
ocG IOC CG car oas cec oj: ctt dc gbs cnrcsG Arc Mr ctg c?gg?c cur cig 

PCQPGEPGLLVGQINQQDPL 
GGG rSOC CPG OOC 033 G?A GZT aJ2 CTT dC GIC QOCCM ATC MT C7G CAAOC QCT OC 

i381R R F D G Y V S [e] SAT [sl K K I A K S V 
a3C CQC T32 GOT CDr TOT GTC PGC GAG AIT Oi: AX5SC iffi AOC GCC CAT A3C GIC 

RRFDGYVSDSATNKK lAHSV 
033 CGT T3r GOT 031 T?T CTT ^GT G\C AST GCC AOC A^C «G ATT CGr CTC A3C GTT 

RRFDGYVSDSATNKK lAHSV 
003 GGC TIC GOT OX TOT GTT PGT GVC AX GCi: PCCMC IfG ATT GX CAT AX GIG 

1441F [s] KGD SA YL SGpDVLVMDE-J:.G 
TICiGC: AA5 G3C GKTi^i: OX IJC GTC XA GST GSCCsIG Cm GEG AOE GOTC^G 5g OX 

FRKGDSAYLS G^D V L V M D E "-t G 
TIC 031 AA3 CJX CST AX OX 17C CTC T3CA OoT C7CGIG CIA GB A3E G^CSG CIG OX 

FRKGDSAYL SGDVLVMDELG 
TIC CCA AA3 GGG CN2 AX OX *I?C CTT TCA OST G7CGIG CIA GIG A3G GACGPG CIG OG 



E N 
GMAPC 
E N 

gagaa: 

E N 
G?GA?C 



V s 
cue IDC 

V s 
GIG TX 

V S 
OA IX 



1501Y MYFRDRSGDTFRWRG 
UC ATG IJC TIC 033 G?C CGT AX GX; QC AX TIC CO? TOG GA OX 
YMYFRDRSGDTFRWRG 
TAC ATG TAT TIC CGT GAT OX J^X CGG OC TICOOC IDG COT OX 
YMYFRDRSGDTFRWRG 
AIG I7C TIC CGT GC OX AX GGG GFCT AX TICOGA TOG OX OX 

156 jwH T E V E [gI VLS RLLGQT DVAVY 
OT: AX GG C?IG CTG nX GIG CIG A3C OX CIG CIG OX CPG AA GC GIG CJX GIC TAT 

TTEVEAVLS RLLGQTDVAVY 
AX AX GG GIG QW CJX C3G CIG ?GC CX CIA CIG OX CFG AX G=C GIGGCT GIG TAT 

TTEVEAVLSRLLGQTDVAVY 
PCC AX GG GIG G^ OX GIG CE AXT OX CIG TIG OX OG AX GPC GIG OCT GIG W 



1621G VAVPGVEG 
CGG GIG OCT GIT CCA C3GA GIG C?G XT 

GVAVPGV EG 
033 GIG GCT GIG CEA (XTx GIG CPG GGG 

gvavpgveg 
a:3\cnG gct gig oia oTi gig gg ggg 

1681 S rn L D P N [a]|T] Y 

AGG ctgtig c?c ox aat gx a:ia T-.C 

S Q L D P N S K V 
pX CrG TX GPC CCT AAT TZ^ AX T^^.C 
I N I Q L D P N S i-: 
^.-i. GG CX G?C OCT A2C TCi^. .-X " C 

174 1 R P I F L R L L ? 

CX OX AX TIC CIG OX XT rx XC 

F. P T F L P. ^ L ? 



K A G M A A IvJ A D P H 
GCAQQGAIGaX GX blCGCAC?craX CTC 

KAGMAA lADPH 
AW OCT OX AIG OCA GX AIC CTAGrT GX C?C 

KSGMAA lADPH 
AVV AX QX AIG OX GX /VTT OIA G?C OX CPC 



Q E L Q K V L 
CPG GrG CTG C?G GG CTG 

g E L Q K V L 
GG G?A TIAC=G GIT CTT C 

0 E L Q K V L 
■C?G G=A TIG C:^ A=G Grr CTT ; 

n V D T T G 7 

Vjau Lx-^k* .'T.^-rt VjUk- -^' ^^ 

^ V D ^T^ T G - 
CTCj CnG G?iX A3C PCA Q ~-^ZZ 
•: D T T G 

C!-r-. GTTG GnT AZA rCA GGT AlUr 



Y A 
X=\T OX 
S Y A 
TX 1?T OCT 

£ Y A 
TX -T^T GT 



hFATPl 
mFATP 
rFATP 

hFATPl 
mFATP 
rFATP 

hFATPl 
mFATP 
rFATP 

hFATPl 
mFATP 
rFATP 

hFATPl 
mFATP 
rFATP 

hFATPl 
mFATP 
rFATP 

hFATPl 
mFATP 
rFATP 



:a^g AIC CA3 hFATPl 
:ah3 ?cx: og mFATP 
^ c& rFATP 
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1801 KTRLQREGFD 
AftG A3G CIG OG OCA GG OH ITT C3C 

KTRLQREGFD 
AAG AX: CH} CIG OG OGT GV^ OCC HT OC 

KTRLQREGFD 
AflG AT 03^ CIA OG OSr OA Oar ITT QC 

1861 L D L K Q G [h] Y 

CPS c?G cic Aft3 CFG 03: TTC to: 

LDLKQGRY 
CIA C?G CIG OG Ga OZT TRT 

LDLKQGRY 
cm G?G CIG AAA OG Q3\ GXT T?C 



P R 0 T 
OA OCT QG /OC 

P R Q T 
GCEGCrrcTG AX 

P R 0 T 

or ca-oc at: 




TIArarG?G 
L D E R 

CEOVTqG Alft 
L D E R 

C3GCATC?G;G^ 



192lfsl G H F fXl L . 

TOGQarGocTictxrcE: "Z?^ hFATPl 

A G D F S L . 
GGA G3C G?C TIC XA CIC *E3V mFATF 

A G D F S L . 
CCA 03Z GC TIC TCA CET XA rPATP 



S D R L F F . ^ 
TCA G?C CGG CK: TIC TIC ^FATPI 

•EA ac AQG Cir TIC TIT mFATP 

S D R L F F ^r^AT^o 
TEA G?C 033 CKT TTC TIT ' 

V [x1[t1 r I c 

GE WOT'ar Arc *zr hFATPl 

V H A R I c 

gk: GATcrrrocr: AIT T^r mFATP 

V H A R I C 

CIC CSr GOT CDC Arc XT l^f^ 



FIG. 
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hsFATPI cDNA 



Start Codon 
CGGGG AG ACG GG ACGTGAAG GG ATG CGGGC 
ACTGGCGCTG CTTTGGTTTC TGGGACTTCC 
TGTGTACGTG GGTGGCGGCG GCTGGCGCTT 
AGACCTCTTT GGCCTCTCTG TTCTGATTCG 
AGCAGGAGAC ACGATCCCGT GCATCTTCCA 
GGCACTGGTG GACGCCAGTA GTGGTATATG 
CAATGCTGTA GCCAACCTGT TCCGCCAGCT 
GTTCCTGGAG GGCCGGCCGG AGTTCGTGGG 
GGTGGCTGCT CTTCTCAATG TCAACCTGAG 
ATCAGCTGCC AAGGCCCTCA TTTATGGCGG 
CGAGCAGCTG GGGAAGAGCC TCCTCAAGTT 
CCTGCCTGAC ACGCAGCTCC TGGACCCCAT 
ACAAGCCCCA GGCAAGGGCA TGGATGATCG 
CGGGCTTCCT AAGGCTGCCA TTGTGGTGCA 
CCACCATTCC TACAGCATGC GTGCCGCCGA 
CTCTGCAGGG AACATCATGG GTGTGGGGCA 
GCGCAAGAAG TTCTCCGCCA GCCGCTTCTG 
AGTGCAGTAC ATAGGTGAAA TCTGCCGCTA 
GCAGCGACAC CGCGTGCGCC TGGCCGTGGG 
GTTCACGCAG CGCTTCGGTG TGCCACAGAT 
CTGCAGCATT GCCAACATGG ACGGCAAGGT 
CACGCATGTG TACCCCATCC GTCTGGTCAA 
GGACTCCGAG GGCCTCTGCA TCCCGTGCCA 
GATCAACCAG CAGGACCCTC TGCGGCGTTT 
CAAGAAGATT GCCCACAGCG TTTTCCGAAA 
GCTAGTGATG GACGAGCTGG GCTACATGTA 
CTGGCGCGGG GAGAACGTGT CGACCACGGA 
CCAGACGGAC GTGGCTGTGT ATGGGGTGGC 
GGCAGCCATC GCAGATCCCC ACAGCCAGTT 
GAAGGTTCTT GCATCCTATG CTCGGCCCAT 
CACAGGCACC TTCAAGATCC AGAAGACCCG 
GACCTCAGAC AGGCTCTTCT TTCTAGACCT 
GAGAGTCCAT GCCCGCATTT GTGCAGGCGA 
CCACAAACTC TGGGCGTGGT GGGAGAGGCC 
GGCCGCCTAG TACACACCCA CCTGGCCGAG 
GAAACTGGAA CCTCAGAGGA -CCCGIGCCT 
TCCTCTCCCT GCTTTTCAGC rTMTGTCTCC 
CG 



TCCTGGAGCA GGAACAGCCT CTGTGGCCTC 60 

GTGGACCTGG AGCGCGGCGG CGGCGTTCTG 120 

TCTGCGTATC GTCTGCAAGA CGGCGAGGCG 18 0 

TGTTCGGCTA GAGCTGCGAC GACACCGGCG 24 0 

GGCTGTGGCC CGGCGACAAC CAGAGCGCCT 3 00 

CTGGACCTTC GCACAGCTGG ACACCTACTC 3 60 

GGGCTTTGCA CCAGGCGATG TGGTGGCTGT 420 

ACTGTGGCTG GGCCTGGCCA AGGCCGGTGT 480 

GCGGGAGCCC CTGGCCTTCT GCCTGGGCAC 54 0 

GGAGATGGCA GCGGCGGTGG CGGAGGTGAG 600 

CTGCTCTGGA GATCTGGGGC CTGAGAGCAT 660 

GCTTGCTGAG GCGCCCACCA CACCCCTGGC 720 

GCTGTTTTAC ATCTATACTT CTGGGACCAC 780 

CAGCAGGTAC TACCGCATTG CTGCCTTTGG 84 0 

TGTGCTCTAT GACTGCCTGC CACTCTACCA 900 

GTGCGTCATC TACGGGTTGA CGGTGGTACT 960 

GGATGACTGT GTCAAGTACA ATTGCACGGT 102 0 

CCTGCTGAGG CAGCCGGTTC GCGACGTGGA 1080 

TAATGGGCTG CGGCCAGCCA TCTGGGAGGA 114 0 

CGGCGAGTTC TACGGCGCTA CCGAGTGCAA 1200 

CGGCTCCTGC GGCTTCAACA GCCGTATCCT 12 60 

GGTCAATGAG GACACGATGG AGCCACTGCG 13 2 0 

GCCCGGGGAA CCCGGCCTTC TCGTGGGCCA 13 80 

CGATGGTTAT GTTAGTGACA GTGCCACCAA 144 0 

GGGCGATAGC GCCTACCTCT CAGGTGACGT 150 0 

TTTCCGTGAC CGCAGCGGGG ACACCTTCCG 156 0 

GGTGGAAGCC GTGCTGAGCC GCCTACTGGG 162 0 

TGTGCCAGGA GTGGAGGGGA AAGCTGGCAT 16 80 

GGACCCTAAC TCAATGTACC AGGAATTACA 174 0 

CTTCCTGCGT CTTCTGCCCC AGGTGGATAC 18 00 

GCTGCAGCGT GAAGGCTTTG ACCCCCGTCA 18 60 

GAAGCAGGGA CGCTATGTAC CCCTGGATGA 192 0 

CTTCTCACTC TGAAGCTGTT CCTCTACTGG 198 0 

AGCTTGAGCC AGACAGCGCT GCCCAGGGGT 2 04 0 

CTGTACCTGG CACGGCCCAT CCTGGACTGA 2100 

CTCTGCTGCC TTGGTGCCCC TGTGTCTGCC 2160 

TTCCATCCNT GTCCCrGTN'T GGCCTTAACC 22 2 0 

2222 

Stop Codon 
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hs FATP1 



FIG. 



4A 



ATG CGGGCT C 
G GG CT G C C 3T 
TGGCGCTTGC 
GATC 

CGGGrtGGCCT* 

TGTAATGCTG 

GCTGCCTGGG 

GACGCTGTCC 

AGCTG 

AGCXGGGGGG 

AGCGACAGCC 

CGCAGC 

CGGG CGA C G T 

GGCTGGC CAA 

TGGCCT'T' CTG 

CGQCjTOPiGQ C 

CGGGGAGTCT 

CACGGT 

TCCGGCGGTG 

TCTGGGGAAA 

GGACACCCAC 

CCCCAGCAAG 

GCCCCACCCC 

ACCACCGGGC 

AATGCCCTCA 
i-^^ 

TTGCTCTG'TG 
ACI'GCTT'GGG 
XXXXXXXXGG 
ACCGCATGGG 
ACTGCCTGCG 
vjG GG G G\j vj A\_ 

v,T\„A i * r^T 

CGACTGCATG 
GAGCTTTGTG 
TGCAAGCT*-. C 
AGAAAAT AA G 
NTGTTTGGAA 
CNGNGT 
GGTTTTT GAC 
G. x^^^w^. • . - 
- AG/-VJ ^ ^.-1. ^ — 
AG G G T G G G.A-. 

NG X GA.T GT G G 



wvjGG X GGGGG 
GGAGGTGGAG 
- GGGGATCGT 
- GGGGG CGGG 
T G GAGG TGGA 
CGTGGTCACT 
TGTCAGCGGG 
CTCCGTCCTC 
CCACCAGCGT 
GCACCAGCGT 
G\jAGCGGCTG 
CGGCTACTCC 
\jvjTGGCCATC 
GGCGGG CATG 

cctgggcacc 
caggcgtggg 

GCTGCGCCCC 
GTATGCCCCG 
ACCATGACCC 
AGTTTGATCA 

gtcctggacc 
ggcatggacg 
ctaacactgt 
tgcccaaggc 

GCCGCTGAGA 
TGTCCATGTT 
TGGAGACCAG 
GCTGGACAAA 

AGCGTTCGGC 
CGTGTACCAC 
CGCTTACCAA 
vjvjGCTGACAG 
.=iAGTACAACT 
CZZGGCChGG 
CAGCTTCGTT 
- GATTTCTTG 
AAAAAAAAAA 
Is'NCTTGCANA 
GGTCTATTTA 

- TGGCTTACT 
<^ - j-iG — ^ GGGA 
^ vj.-iGGT'A 
rG.-.CGrMGGN 
GGGGANTTTT 



GGGGGGG1 
*- GCGGv_.rtG GG 
CTGCAAG 
GCTGAGGGCT 
GAGTGAXXXX 
G AGAGAT C AG 
AGGCTGAGGG 
CCTCCCAGCG 
GCCGGCCAGG 
GCCGGCCAGA 
GCGCT 

AATGCGGTAG 

GAGGCCGCGC 
TCGGGCGCTA 
CATCAGGTGG 
AGGCCTGGGA 
GGCAGGGAGT 
ATGTGTTGGG 
AGTTCTGCTC 
CGCTGCTGAA 
GTGAGTCAAC 
ATCTCCTGCA 
TGCCATTGTC 
GTGACCCAGG 
AC C C^GGGG A 
GGCCAAGCGG 
GG C AT C A C A. C 
CCAGCCTGTG 
CACCACGCCr 
TCGGCAGGTA 
GGCCACCCrC 
TCGTCCTCGG 
GCACGCCrrG 
GCCCAGCCG7 
ATTTGTGTTT 

GGCTGCTrrr 

TAGGNAAATT 
GATTTGGG G G 
GATCAGAT A G 
A GAGGGGGA.G 
GGAACGTGTG 

- - AC.-iGGCA. "Z 

- -t ACTA G ATT 



WW ^ X GGTCTCGC TG 



GCGAGGCGAG 
C TGGGGG C C C 
>LXXXXATACG 
CACAAAGTTC 
TCCAGAGGCC 
GTCTCTCTGT 
GTCTCTCTGT 
CCATCCCGCG 
ATGCCGGGAC 
CCAACCTCTT 
GCCGGCCGGA 
* TGCTCAACGT 
AGGCCCTGAT 
GCGGGGACCC 
AGGCGGCCGC 
TGGTGCATCC 
GACCACAGCG 
TGGAGACTTG 
GGAGGCCTCT 
TTCCAGGACA 
GATCGTCTTT 
GTGCACAGCA 
CATCTTGCCA 
CAGAGAGGGC 
- GGACGTGTA 
GATTTTCACA 

^rr^ ^^^^ ^ 
\- w • iw^wVJVUw ^ v_ 

ACCGCATGCA 
GTACGGGCTG 
TGCAGGAAAC 
CAAGAAATTC 
GGIJNTCCGAA 
TTTATCGGTG 
GCAGCGGAAT 
TGTTAGAANN 
! 'GATTAG AT A 
GGTATG 

w - w^J^J^J i. 



- _ G/nGATGGA 



T GG CG CTGTT 
TGTACGTGGG 
ACCTCTTGTG 
ACGCTGCAGG 
CTGAGATCTA 
ACATCGCCTC 
AGGCGGGGCA 
GCTGATCCGC 
GCTGATCCGC 
CATCTTTCAG 
CGGCGAGTGC 
CCGCCAGCTG 
GTTCGTGGGG 
GAACCTGCGG 
CTTTGGAGGA 
AGGACTGGCC 
CTGGACGTGG 
CAGGCCTCGG 
GTGGCCGAAG 
GGGCCCGAGG 
ACTGCCCCGT 
CCTCTACCCC 
TCTACATCTA 
GGTGAGGGGC 
GCCTGACCTG 
AGCTGGTGTT 
AACTCATTGC 
CCATCTTGGG 
CCCCTCCCCC 
GGCGGCTGAC 
GGTAGGGAAT 
ATCATCGGCG 
TCGGCCAGCC 
AGTGCTGAGA 
CAGCTGCTGG 
CTTCCAGGCT 
GAAGTTCTAC 
CTNACTNGCA 
TGGCACATCT 
NTAGCTTGTW 
GTCCCCAG 
T CAAG AG ATT 
v-j X v_TGGCAAA 
GTCTGGAACT 
A\_AGGCATGA 

gtctnggt::t 



-■^--Gg::gg t:"^aagc tzcgcctczz gggt::tacc 



. _ _ ^ 



X >J X VJ w w^. 



•-j-GGC*GCTG 60 
CAGCGGCGGC 120 
rtGTGTTGCGG 180 
GCTGGGCTTG 24 0 
CTCTCTGCTG 3 00 
ATGAAAGCCT 360 
GGGCACCAGT 420 
GTGCGCCTGG 4 80 
GTGCGCCTGG 54 0 
GCGGTAGTGC 6 00 
TGGACCTTTG 660 
GGCTTCGCGC 720 
V — ^ X GGCTGG 7 80 
CGCGAGCCCC 84 0 
GAAATGGTGG 900 
CCTGGGGGGG 960 
GCATGAGGTG 1020 
GAGGGGCCTG 108 0 
TGAGCGGGCA 1140 
GCATCTTGCC 1200 
TGGCACAGAT 1260 
AATGACTCAG 1320 
CACGTCGGGG 13 80 
CCACAGGCAT 1440 
CCCCTCAGCT 1500 
TCCTGAGCAC 1560 
AAGGGTCATA 16 20 
ACCXXXXXXX 1680 
TGCAGGTACT 1740 
GTGCTGTATG 18 00 
\j:\jT GGGTGGG 1860 
TGGGGCAGTG 192 0 
GCTTCTGGGA 198 0 
GMACAGGCGT 2 04 0 
CATCCGGTGC 2100 
GTTCTTTGTT 2160 
CGAGGGTNNC 2 22 0 
TAATANGNGA 22 8 0 
GGTCGTCACC 23 4 0 
TGTAAAAATC 2 4 00 
TGGAGTACAA 2 4 60 
2520 
2580 
2640 
2700 
2760 
2820 
2B80 



— . -VJV.^. w 



-wJXiVJX 
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NTAATTTTTT 
ATCTCCTGAC 
AGCCACCCCG 
GGGTGATTTN 
TGGCCCAATT 
GCAGCCAGTG 
NTCAGCTTCA 
TNGTCAGNAC 
GCTGAGACTG 
AGCATGTGCA 
GCAGGGGCGG 
TGTGAGCGGG 
CCTGGCANGC 
TTCTGCCACT 
TTCTCCATCT 
NTNTTAAAGG 
GAATGTGGCA 
GCACCTGGGG 
CCAGGGGTGA 
CTCCAGGCTG 
CTGCTGAGGC 
CACGAAGGGG 
GCATCCGGCC 
GGAGGATTGC 
TCTGTACAAA 
AAGGCACCCC 
GGGCAGGGGG 
AAGAGACTGA 
AGCCATCATC 
CGGGGTNTCT 
CACCCCGTCC 
CCGGTGCGCG 
CCTGCCATCT 
GGCGCCACCG 
CCCCGGGGCA 
ACATAAAACA 
GATCTGGACT 
TTCACACAGA 
CCAGAGGTCT 
GAGGCGACGC 
AGGACCCCGC 
GGTTTCAACA 
GACACAATGG 
AG CAGGG C C C 
GCAGCCTGAA 
.SlACAGGACCG 
^ CG C C G--\C AG 
GCC CGGTG A 



GrcATTrrrTA 
>:tngtgatcc 

CC1*GGCCG - G 

CCAGGTTrJTG 

NTGNGAATGG 

A X Gl"!* GGG "TG 

GAGCATGGCG 

NNTCTTCTGG 

AAGAGTTCAG 

AAGTCCCrGA 

CAGGGCCAGA 

AAGCCCCGAG 

NANGGNTTCA 

TCCCAGCTCN 

GTAAAGTGGC 

CACCGTGGGG 

GGGAGGGAAG 

NTCAGTCCAG 

CGCTCCGGCC 

AGNGTTGCCT 

TCAGACAGGG 

TGCGGGTACG 

AGGCACAAGT 

TTGAGGCCAG 

AAAATCTGTA 

GTTAGAGGCT 

AGACCCCCCG 

ACGAGT C CT A 

ATCCTTAGGC 

ACCTC 

CCCAGGTGGT 

AGGCGGAGAG 

G GGAGG AG T T 

AGTGCAACTG 

GoT'CT'GGGAG 

GCCXGGACTG 

w i ^J^wACAC 

GGATCTACTT 
TCTCTTGACT 
TTACTACCCT 
Ax AXGC GG 

vjv— v_\-;\_rAj. 

-■^vjv— A - GC*o 

■ G \- i ^>;vJ i v_ 

^.ATGGC CTTC 



GTAGAGACGG 

CCCATTCTTA 
TCGGTAGAGA 
GGGNGGGGAC 
CATGCATTTC 
GTGGTGTCTG 
GGGANNAAGG 
GCAAANGCAT 
GGAGGCAGAG 
TCCAGTTGGC 
CACANTTGCA 
TTCCCATTNT 
GTGATCTCGT 
GGGGGGGGAG 
CACCTAGTGG 
ATGTCCTGCA 
ACGCAGCGCA 
AGCCCAGGCT 
TTNGGGGTGC 
AGTGGTATCT 
GGGAGGAGGA 
GGCTCACACC 
GAGTTCAAGA 
GTCCCAGCTA 
GCAGTGAGCT 
CCNAAAAACA 
GGAGTCAATG 
TGTTCCGCTC 
CGGGCTCCCC 
TCAGTACATC 
GCGACACCGC 
CACGGAGCGC 
CAGCATTGCC 
TTCAGGGAAG 
GCGCGGAAGG 
AGAACTCTGA 
CCTGCCAGAC 
GAATGCAGGC 
GCTTTTGCAG 
CTTTCCTACT 
GCCCCACGTG 
vjGAXGCCCAG 
w s-CACCCGGA 
T C C GT AGGGG 

- _GGr-vxGG\ 

_-\Gvj \„ G v-i Gj-v 



GGTTTNACCA 
NCNTCCG.AAA 
AGCCAGTCAT 
TNTTGTTGGG 
CTTCCCCCCC 
TTNTCTGGGC 
CTGATCATAT 
GGGNNCAGGG 
CTGGGGGAAG 
TGTTCAGCGG 
CACAGGTAGA 
GCTGNTATGA 
GGGGGCCGGG 
TCCAGTTCCT 
TCATTTCCGA 
ATGACCCCAG 
GGGGGAGCCC 
GCCTAGTGGG 
CAGCCAGGGG 
CAGNTCTCCT 
CCAGGAAGCT 
AAGACACTGC 
TGGAATCTCA 
CCAGCCTGGG 
CTCGGGAGGC 
GTGATCATGC 
ACAAAAAAAG 
TGGTCCCTAA 
ATAGGATTAG 
ACCGCCTGCC 
GGGGAGATCT 
GTGCGCCTGG 
TTCGGCGTAC 
AACATGGACG 
ACACTTGTCT 
CTCGCAAGGC 
GXXXXXXXXX 
TGAGGTTTGT 
TGGGAAGGTG 
ATCAGCCAAG 
CAGTTCACCC 
TACCCGATCC 
GGGCTGT'GC/-i 
G C AGGGGTC C 
AGCCTGGCCT 
ATG T C.— .G C G.A 



TGTTAGCCAG 
GTGCTGGGAT 
ATTGGCNTGA 
TGAACCANAC 
TMNGACCAGG 
TTCAGTGCTT 
TGGTTTAGTC 
GTGCTCfcAG 
ACTGCTTCAG 
AGATGGAGGG 
AATGTGGGTT 
TCCCATCTGG 
ATGCCTGGGT 
TACCATCTGT 
AGCCCAACTG 
GAAGNTCTTG 
ATGAGCAGGT 
TGCACCTGGG 
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,-.CACCZTCCO CTGGCGAGGG GAGAACGTCT CCACCACCGA GGTGGAGGGC GTGCTGAGCC 588 0 
GCCTGCTGGG CCAGACAGAC GTGGCCGTCT ATGGGGTGGC TGTTCCAGGC AAGCTGGGGT 5 94 0 
TGCAGGGGGT GGTCCTGAGG CATGGTCCTG AGGGAGCTCA GCCAAAAGGG GCTTAGAGGT 60 0 0 
ACACATGCCT TTGGCAGTGC ACAACCTGGA CAACTGCTCA TGGCAGCCCA GGAGGAAGCA 6060 
CTGGATCTGG AGCCAGTTCA CCTGGGTGAT GTTGAGCCTC AGTTTTGTCA TCAGAAAATG 6120 
GGATCATGAA AGCCCACCTG TATTAGGGCT TCAATGAGCC AAGCAGGAGC TCCCCAAAAT 618 0 
GTGTGGCTGC TTCCATAAAT GTCATCCCAG GTTGGGAGAG ACTGGAGATT ACAGACCTGC 624 0 
TACTGCTTGA CAGTGTATCT GGTCCTGCTG GTGAGGATGA GAGGCGGGGT GTCCTCAGCT 6300 
CCTCTGCCTC CAGGAGTGGA GGGTAAGGCA GGGATGGCGG CCGTCGCAGA CCCCCACAGC 6.3 60 
CTGCTGGACC CCAACGCCAT ATACCAGGAG CTGCAGAAGG TGCTGGCACC CTATGCCCGG 6420 
CCCATCTTCC TGCGCCTCCT GCCCCAGGTG GACACCACAG GTGCGAGTCT CCCCCACTCC 64 8 0 
AACCTCTCTC TTCATCCATC AGTGTGTCTG TTGATTCGAG GGATATTGAG TTGAGGCCTC 654 0 
CAGAAGCCAC CTGCTCAGCC CTTATCTGCC CCCCATCCCC ACTATAGGCA CCTTCAAGAT 6600 
CCAGAAGACG AGGCTGCAGC GAGAGGGCTT TGACCCACGC CAGACCTCAG ACCGGCTCTT 6660 
CTTCCTGGAC CTGAAGCAGG GCCACTACCT GCCCTTAAAT GAGGCAGTCT ACACTCGCAT 672 0 
CTGCTCGGGC GCCTTCGCCC TCTGA 674 5 
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FIG. 8B 

Intron 1 

GGCGAGACCTCTT— ? — CGGTCTCTCTGT 

Intron 2 

AAATGGTGGCGGgtgaggcc— 215— gaccacagCGGTGGCCGAAG 

Intron 3 

AGGGCATGGACGgtgagtca 1 31 tcctgcagATCGTCTTTTCT 

Intron 4 

CGTGCACAGCAGgtgagggg-1500"CCCCtgcagGTACTACCGCAT 

Intron 5 

ACCACTCGGCAGgtactacg 68 ctctgcagGAAACATCATCG 

Intron 6 

TACAACTGCACG >3kb cgtcccccacGTCGGCTCCTGT 

Intron 7 

TGGACGGCAAGgtgcacacc — 3000 — cattccagGTCGGCTCCTGT 

Intron 8 

CCCTGCCAGGCCgtgagcag— 2900— ctccctagGGGAGCCTGGCC 

Intron 9 

CCTACCTCTCAGgtgcgcag 90 tctgccagGTGACGTGCTAG 

Intron 10 

TGGCTGTTCCAGgtcaagct — 389 — gcctccagGAGTGGAGGGTA 

Intron 1 1 

TGGACACCACAGgtgcgagt — 2800—cactatagGCACCTTCAAGA 
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